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10 Throughout this application various publications are referenced. The disclosures of these 
publications in their entireties are hereby incorporated by reference into this application in 
order to more folly describe the state of the art to which this invention pertains. 
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FIELD OF THE INVENTION 

The present invention relates generally to nucleotide sequences, and polypeptides 
encoded by the sequences, that are essential for bacterial viability, and to methods of 
using the nucleotide and polypeptide sequences, 

20 BACKGROUND OF THE INVENTION 

Bacterial genera, such as Streptococcus, Staphylococcus, Pseudomonas, Yersinia, 
Salmonella, and Enter obacter, are the cause of numerous afflictions in humans and 
animals. Bacterial infection can lead to serious health conditions, including pneumonia, 
25 osteomyelitis, meningitis, sinusitis, otitis, cystitis, and even food poisoning. .Typically,' 
these infections can be treated with standard antimicrobial agents such as antibiotics' 
However, the emergence of pathogenic bacterial strains that are resistant to antibiotics 
has risen alarmingly in the past two decades. This situation has created an urgent need 
for the development of new antimicrobial agents. 

30 

One strategy for developing new antimicrobial agents is to identify bacterial gene 
sequences that encode gene products that are essential for bacterial cell viability and 
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develop and/or identify agents which inhibit the function of the gene product. DNA 
sequencing technology has advanced from sequencing one gene at a time to sequencing 
entire genomes, the sum of all genes in an organism. With the recent arrival of bacterial 
genomic information, it is now possible to compare multiple bacterial genomes in an 
attempt to identify genes that encode conserved gene products. In this manner, one 
skilled in the art may identify a set of conserved bacterial genes, including a subset of 
genes that are essential for bacterial cell viability. The essential gene is then used as a 
starting point to develop therapeutic agents that inhibit or inactivate the product of the 
essential gene. 

The availability of DNA sequence information for multiple microbial genomes is a recent 
development. The public release of the first complete genome, Haemophilus influenzae 
(Fleischmann, R.D., et al. 1995 Science 269:496-512 ), Was followed in rapid succession 
by a number of public and private genome sequencing programs. Presently, some 20 
completely sequenced bacterial genomes have been published, and over 100 other 
sequencing projects are underway (Blattner, F.R., et al., 1997 Science 277:1453-74; 
Ferretti, J.J., et al., 1997 Adv Exp Med Biol 418:961-963; Koonin, E.V., et al., 1996 . 
Methods Enzymol 266:295-322). Analyses of these data indicate that approximately 
46% of putative bacterial genes are of unknown function having no attributable function. 

Others have pursued various strategies to identify bacterial genes that are essential for 
viability. These strategies include: identifying genes that are expressed by the bacteria 
when present in the infected host (Hensel, M, et al., 1995 Science 269:400-3), 
identifying essential genes by isolating temperature sensitive mutants (Schmid, M.B., et 
al., 1998 Curr Opin Chem Biol 2:529-34), and identifying genes in pathways known 
from prior physiological studies to be essential (Skarzynski, T. et al., 1996 Structure 
1996 4:1465-74) 

There continues to be a need to identify bacterial genes that encode gene products that are 
30 essential for ceil viability, such as cell replication, growth, and survival. These genes and 
their encoded gene products can be used as a starting point towards identifying agents 
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that inhibit functions essential for cell viability, thereby causing bacterial cell stasis or 
death (e.g., antibacterial agents). 

The present invention provides experimental identification of novel, conserved essential 
5 genes (ceg) from bacteria and their encoded protein products. The ceg genes are 
considered essential to cell viability because disruption of an endogenous ceg gene results 
in lethality of a bacterial cell (e.g., as determined by failure to recover viable 
chloramphenicol-resistant colonies, as described herein). . Thus, the gene products 
encoded by these genes are potentially valuable targets for chemotherapeutic intervention 
10 of bacterial infections. 

The ceg nucleotide sequences of the invention were obtained by large-scale 
computational comparisons of multiple genome sequences to identity conserved protein 
coding regions, followed by gene disruption to identify cegs. The conservation of protein 
15 sequences in many cases is believed to reflect the higher level conservation of common 
biochemical pathways essential for bacterial function and viability. 

SUMMARY OF THE INVENTION 

20 The acronyms "CEG" and "ceg" stand for Conserved Essential Gene. For convenience, 
the italicized term ceg refers herein to ceg nucleotide sequences. The capitalized term CEG 
refers herein to CEG polypeptide sequences. 

Embodiments of the ceg nucleotide sequences and the CEG polypeptide sequences are 
25 designated CFEs which stands for CEG For Expression. The CFEs are polypeptides 
resulting from expression of the ceg nucleotide sequence. 

The .present invention provides isolated nucleotide sequences of conserved essential 
genes from bacteria, designated ceg. The invention also provides recombinant nucleic 
30 acid molecules including the ceg sequences of the invention, and methods of uses thereof. 
Examples of nucleic acid molecules having ceg sequences are described in SEQ ID 
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NOS, 1-113. The invention farther provides isolated polypeptides and recombinant 
polypeptides having the CEG sequences of the invention, and methods of uses thereof 
Examples of polypeptides having CEG sequences are described in SEQ ID NOS -114 
226. 

The ceg sequences of the present invention are DNA or KNA. Further, the invention 
includes nucleic acid molecules that are identical or nearly identical (e.g., similar) with 
the ceg sequences of the invention. The invention additionally provides polynucleotide 
sequences that hybridize under stringent conditions to the ceg sequences of the invention 
A further embodiment provides polynucleotide sequences which are complementary to 
the ceg sequences of the invention. Yet another embodiment provides ceg nucleic acid 
molecules that are labeled with a detectable marker. Another embodiment provides 
recombinant nucleic acid molecules, such as a vector or a fusion molecule, including the 
ceg sequences of the invention. 

The present -invention provides various ceg sequences, fragments thereof having essential 
gene activity, and related molecules such as antisense molecules, oligonucleotides 
peptide nucleic acids (PNA), fragments, and portions thereof, 

20 The present invention relates to the inclusion of the polynucleotides encoding CEG gene 
products, such as CEG polypeptides, in an expression vector which can be used to 
transform host cells or organisms. Such transgenic hosts are useful for the production of . 
CEG gene products for the development of antibacterial agents such as antibiotics. 

25 The invention further provides substantially purified CEG gene products, and uses 
thereof. 

The invention also relates to pharmaceutical compositions comprising antisense 
molecules capable of disrupting expression of ceg sequences, agonists, antagonists or 
30 inhibitors of CEG gene products, and antibodies reactive against the CEG polypeptides. 
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These compositions are useful for preventing the growth or survival of bacteria, for 
example, in the treatment of conditions associated with bacterial infections. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1: A schematic representation of the gene disruption assay, as described in Example 
3, infra. A) A recombinant vector undergoing homologous recombination with the host 
genome. B) The result of homologous recombination. 
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Figure 2: A schematic representation of the polarity test for operons, as described in 
Examples 2 and 3, infra. A) The recombinant vector undergoing homologous 
recombination with the host genome. B) Case 1: one possible result of homologous 
recombination; the downstream Gene B has an independent promoter. C) Case 2; another 
possible result of homologous recombination; the downstream Gene B does not have an 
15 independent promoter. 

Figure 3: Purification of 2CFE 75, as described in Example 6, infra. A) Fractionation 
profile of 2CFE 75 eluted from a Ni-NTA column. B) Gel electrophoresis of pooled 
fractions of CFE 75. C) Non-denaturing gel electrophoresis to determine oligo form of 
20 2CFE75. 

Figure 4: Fractionation profile of 2CFE 3 eluted from a hydroxyapatite column, as described 
in Example 7, infra. 

25 Figure 5: The biosynthesis pathway of Coenzyme A which starts with phosphorylation of 
pantothenate. 

Figure 6: Circular dichroism spectra of 2CFE 101 and 103, as described in Example 10, 
infra. A) Circular dichroism spectra of 2CFE 101 and 103 at 25 degrees C. B) Circular 
30 dichroism thermal melt spectra of 2CFE 101 and 103 at a range of zero to 100 degrees C. 
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Figure 7: Circular dichroism spectra of aggregate and monomer pools of 2CFE 101 and 103, 
as described in Example 10. infra. A) Circular dichroism spectra of aggregate and monomer 
pools of 2CFE 101 and 103 at 25 degrees C. B) Circular dichroism thermal melt spectra of 
aggregate and monomer pools of 2CFE 101 and 103 at a range of zero to 100 degrees C. 

Figure 8: Absorbance spectra of pantothenate-dependent production of ADP, as described in 
Example 10, infra. - 

Figure 9: The results of size exclusion chromatography and gel electrophoresis showing the 
oligomeric forms of 2CFE 21 and 39, as described in Example 1 1, infra. Lanes 1-6 contain 
2CFE 21, lane 7 is a molecular weight marker, lanes 8-10 contain 2CFE 39. 



Figure 10: Gel electrophoresis of a helicase reaction using 2CFE 21 and 39 and radiolabeled 
synthetic Holliday Junction template, as described in Example 11, infra. Lane 1 contains 

15 the synthetic Holliday Junction template; lane 2 contains the synthetic duplex; lane 3 
contains a single-stranded template; lane 4 contains the helicase reaction using 2CFE 39; 
lane 5 contains the helicase reaction using 2CFE 21; lanes 6-8 contain the helicase reaction 
using 2CFE 39 and 21 at varying concentrations (e.g., 1, 2, and 3 uM each); and lane 9 
contains the helicase reaction using 2 uM each 2CFE 39 and 21 in the presence of ethidium 

20 bromide. 

Figure 1 1: A graph depicting the results of the helicase reaction which were monitored by 
measuring the unquenching of the Holliday Junction templates with time, as described in 
Example 1.1, infra. 
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Figure 12: Capillary electrophoresis results, of 2CFE 8 with and without ssDNA, as 
described in Example 12, infra. A) Electropherogram of 2CFE 8 alone: B) 
Electropherogram of 2CFE 8 in the presence of a 32-nucleotide single-stranded oligomer. 

.30 Figure 13: Gel mobility shift assay of 2CFE 8, and 2CFE 8 in the presence of a single- 
stranded 32-mer, as described in Example 12, infra. A) An ethidium bromide-stained, 
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native, polyacrylamide gel containing 2CFE 8, and 2CFE 8 in the presence of a 32-mer. B) 
The same native, polyacrylamide gel stained with Coomassie. 

Figure 14: The N-acetyl glucosamine pathway putatively mediated by 2CFE 3 and 2CFE 
86, as described in Example 13, infra. 



Figure 15: Capillary electrophoresis results of 2CFE 3 with and without putative substrates, 
as described in Example 13, infra.. A) Electropherogram of 2CFE 3 with .and without 
glucosamine-l-phosphate. B) Electropherogram of 2CFE 3 with and without D-glucose-1- 
10 phosphate. C) Electropherogram of 2CFE 3 alone, 2CFE 3 and glucose- 1 -phosphate, and 
2CFE 3 and glucose-6-phosphate. D) Electropherogram of 2CFE 3 alone or in the presence 
of glucosamine-l-phosphate, glucosamine-6-phosphate, D-glucose, D(+) galactose, and a- 
D-glucose-1 -phosphate. 

15 Figure 16: Capillary electrophoresis results of FITC-derivitized 2CFE 3 polypeptide with 
and without D-gIucosamine-6-phosphate (substrate) to produce the product D-glucosamine- 
1-phosphate, using laser-induced fluorescence, as described in Example 13, infra. 
Electropherogram of D-gIucosamine-6-phosphate (putative substrate), 2CFE 3 reacted with 
D-glucosamine-6-phosphate, and the product glucosamme-l-phhosphate. 
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Figure 17: Gel electrophoresis of 2CFE 86 eluted from an Ni-NTA column, as described in 
Example 13, infra. 

Figure 18: HPLC analysis of a coupled reaction including 2CFE 3, 2CFE 86, and D-' 
glucosamine-6-phosphate to produce the product, UDP-N-acetylglucosamine-l-phosphate 
(UDPAG), as described in Example 1 3, infra. 

Figure 1 9: A fatty acid biosynthesis pathway. 
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Figure 20: Size exclusion chromatography to determine the molecular weight and 
oligomer* form of 2CFE 34, as described in Example 1.4, infra.. Selected eluted samples 
were sized by gel electrophoresis. 

Figure 21: Gel electrophoresis of 2CFE 41 eluted from a Ni-NTA column, as described in 
Example 15, infra. 



Figure 22: Capillary electrophoresis results of 2CFE 40, 41, and 46, as described in Example 
15, infra. 

10 

Figure 23: Depicts a schematic diagram of a ligand which binds 2CFE 34. The ligand is 2- 
phenyl-N-(3 corboxyMhydroxyphenyl) azabicyclo [4.3.0] riona-2, 8-diene. 

Figure 24: Depicts a schematic diagram of a ligand which binds 2CFE 43. The ligand is N- 
15 (3, 5-dinitrobenzyl)-7-trifluoromethyl benza diaza ruranolactone. 

Figure 25: Depicts a schematic diagram of a ligand which binds 2CFE 43. The ligand is 2- 
amino (N-para-methylphenyl sulfonamide)-3-phenylpropianic acid. 

20 Figure 26: A nucleic acid sequence of 2CFE1 deposited with the American Type Culture 
Collection as ATCC designation. on December 20, 2000. 

Figure 27: A nucleic acid sequence of 2CFE2 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

25 

Figure 28: A nucleic acid sequence of 2CFE3 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 29: A nucleic acid sequence of 2CFE4 deposited with the American Type Culture 
30 Collection as ATCC designation 0 n December 20, 2000. 
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Figure 30: A nucleic acid sequence of 2CFE5 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

5 Figure 31: A nucleic acid sequence of 2CFE6 deposited with the American Type Culture 
Collection as ATCC designation • 0 n December 20, 2000. 

Figure 32: A nucleic acid sequence of 2CFE7 deposited with the American Type Culture 
Collection as ATCC designation __ on December 20, 2000. 

10 

Figure 33: A nucleic acid sequence of 2CFE8 deposited with the American Type Culture 
Collection as ATCC designation __ on December 20, 2000. 

Figure 34: A nucleic acid sequence of 2CFE9 deposited with the American Type Culture 
1 5 Collection as ATCC designation 0 n December 20, 2000. 

Figure 35: A nucleic acid sequence of 2CFE10 deposited with the American Type Culture 
Collection as ATCC designation _____ on December 20, 2000. 

0 Figure 36: A nucleic acid sequence of 2CFE11 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 37: A nucleic acid sequence of 2CFE12 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 38: A nucleic acid sequence of 2CFEI3 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

Figure 39: A nucleic acid sequence of 2CFE14 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 40: A nucleic acid sequence of 2CFE15 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

Figure 41 : A nucleic acid sequence of 2CFE16 deposited with the American Type Culture 
5 Collection as ATCC designation 0 n December 20, 2000. 

Figure 42: A nucleic acid sequence of 2CFE17 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

10 Figure 43: A nucleic acid sequence of 2CFE19 deposited with the American Type Culture • 
Collection as ATCC designation 0 n December 20, 2000. 

Figure 44: A nucleic acid sequence of 2CFE21 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

15 

Figure 45: A nucleic acid sequence of 2CFE24 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 46: A nucleic acid sequence of 2CFE25 deposited with the American Type Culture 
10 Collection as ATCC designation 0 n December 20, 2000. 

Figure 47: A nucleic acid sequence of 2CFE26 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

5 Figure 48: A nucleic acid sequence of 2CFE27 deposited with the American Type Culture 
Collection as ATCC designation - on December 20, 2000. 

Figure 49: A nucleic acid sequence of 2CFE28 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 
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Figure 50: A nucleic acid sequence of 2CFE29 deposited with the American Type Culture 
Collection as ATCC designation 0Q December 20, 2000. 

Figure 51: A nucleic acid sequence of 2CFE30 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 52: A nucleic acid sequence of 2CFE31 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 53: A nucleic acid sequence of 2CFE32 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000, 

Figure 54: A nucleic acid sequence of 2CFE33 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 55: A nucleic acid sequence of 2CFE34 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

Figure 56: A nucleic acid sequence of 2CFE35 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 57: A nucleic acid sequence of 2CFE36 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 58: A nucleic acid sequence of 2CFE37 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 59: A nucleic acid sequence of 2CFE38 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 60: A nucleic acid sequence of 2CFE39 deposited with the American Type Culture 
Collection as ATCC designation ______ on December 20, 2000. 

Figure 61: A nucleic acid sequence of 2CFE40 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 62: A nucleic acid sequence of 2CFE41 deposited with the American Type Culture 
Collection as ATCC designation • on December 20, 2000. 

10 Figure 63: A nucleic acid sequence of 2CFE42 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 64: A nucleic acid sequence of 2CFE43 deposited with the American Type Culture 
Collection as ATCC designation , 0 n December 20, 2000. 

15 

Figure 65: A nucleic acid sequence of 2CFE44 deposited with the American Type Culture 
. Collection as ATCC designation 0 n December 20, 2000. 

Figure 66: A nucleic acid sequence of 2CFE45 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 67: A nucleic acid sequence of 2CFE46 deposited with the American Type Culture 
Collection as ATCC designation ■ on December 20, 2000. 

25 Figure 68: A nucleic acid sequence of 2CFE47 deposited with the American Type Culture • 
Collection as ATCC designation on December 20, 2000. 

Figure 69: A nucleic acid sequence of 2CFE48 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

30 
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Figure 70; A nucleic acid sequence of 2CFE49 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 71 : A nucleic acid sequence of 2CFE50 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 72: A nucleic acid sequence of 2CFE51 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 73: A nucleic acid sequence of 2CFE52 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 74: A nucleic acid sequence of 2CFE53 deposited with the American Type.Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 75: A nucleic acid sequence of 2CFE54 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 76: A nucleic acid sequence of 2CFE55 deposited with the American Type Culture 
20 Collection as ATCC designation . on December 20, 2000. 

Figure 77: A nucleic acid sequence of 2CFE56 deposited with the American Type Culture • 
Collection as ATCC designation _ on December 20, 2000. 

25 Figure 78: A nucleic acid sequence of 2CFE57 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 79: A nucleic acid sequence of 2CFE58 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 80: A nucleic acid sequence of 2CFE59 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 81: A nucleic acid sequence of 2CFE60 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 82: A nucleic acid sequence of 2CFE61 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 83: A nucleic acid sequence of 2CFE62 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 84: A nucleic acid sequence of 2CFE64 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

15 

Figure 85: A nucleic acid sequence of 2CFE65 deposited with the American Type Culture 
Collection as ATCC designation on December 20,.2000. 

Figure 86: A nucleic acid sequence of 2CFE66 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 87: A nucleic acid sequence of 2CFE67 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 88: A nucleic acid sequence of 2CFE68 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 89: A nucleic acid sequence of 2CFE69 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

30 
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Figure 90: A nucleic acid sequence of 2CFE70 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

Figure 91: A nucleic acid sequence of 2CFE71 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

. Figure 92: A nucleic acid sequence of 2CFE72 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

10 Figure 93: A nucleic acid sequence of 2CFE75 deposited with the American Type Culture 
Collection as ATCC designation 0 n December 20, 2000. 

Figure 94: A nucleic acid sequence of 2CFE76 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 95; A nucleic acid sequence of 2CFE78 deposited with the American Type Culture 
Collection as ATCC designation _ on December 20, 2000. 

Figure 96: A nucleic acid sequence of 2CFE79 deposited with the American Type Culture 
■ 20 Collection as ATCC designation on December 20, 2000. 

Figure 97: A nucleic acid sequence of 2CFE80 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 98: A nucleic acid sequence of 2CFE81 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 99: A nucleic acid sequence of 2CFE82 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 1.00: A nucleic acid sequence of 2CFE83 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 101: A nucleic acid sequence of 2CFE84 deposited with the American Type Culture 
5 Collection as ATCC designation 0 n December 20, 2000. ' 

Figure 102: A nucleic acid sequence of 2CFE85 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 103: A nucleic acid sequence of 2CFE86 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

Figure 104: A nucleic acid sequence of 2CFE87 deposited with the American Type Culture 
Collection as ATCC designation : on December 20, 2000. 

15 

Figure 105: A nucleic acid sequence of 2CFE88 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 106: A nucleic acid sequence of 2CFE89 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 107: A nucleic acid sequence of 2CFE90 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 108: A nucleic acid sequence of 2CFE91 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 109: A nucleic acid sequence of 2CFE92 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 110: A nucleic acid sequence of 2CFE94 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 111: A nucleic acid sequence of 2CFE95 deposited with the American Type Culture 
5 Collection as ATCC designation 0 n December 20, 2000. 

Figure 1 12: A nucleic acid sequence of 2CFE96 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 113: A nucleic acid sequence of 2CFE97 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 114: A nucleic acid sequence of 2CFE99 deposited with the American Type Culture 
Collection as ATCC designation ______ on December 20, 2000 

15 

. Figure 115: A nucleic acid sequence of 2CFE101 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 1 16: A nucleic acid sequence of 2CFE102 deposited with the American Type 
20 CultureCollectionasATCCdesignation^ on December 20, 2000. 

Figure 117: A nucleic acid sequence of 2CFE103. deposited with the American Type 
Culture Collection as ATCC designation ________ on December 20, 2000. 

25 Figure 118: A nucleic acid sequence of 2CFE104 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 119: A nucleic acid sequence of 2CFE105 deposited with the American Type ' 
Culture Collection as ATCC designation on December 20 2000 

30 
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. Figure 120: A nucleic acid sequence of 2CFE106 deposited with the American Type 
Culture Collection as ATCC designation on December 20> 2000 

Figure 121: A nucleic acid sequence of 2CFE107 deposited with the American Type 
5 CultureCoUectionasATCCdesignation.,^ on December 20, 2000. 

Figure 122: A nucleic acid sequence of 2CFE108 deposited with the American Type 
Culture Collection as ATCC designation 0 n December 20, 2000. 

10 Figure 123: A nucleic acid sequence of 2CFE109 deposited with the American Type 
Culture Collection as ATCC designation on December 2Q> mQ 

Figure 124: A nucleic acid sequence of 2CFE111 deposited with the American Type 
Culture Collection as ATCC designation: on December 20, 2000. 
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Figure 125: A nucleic acid sequence of 2CFE112 deposited with the American Type 
Culture Collection as ATCC designation on December 2Q> mQ 

Figure 126: A nucleic acid sequence of 2CFE113 deposited with the American Type 
20 Culture Collection as ATCC designation . on December ^ 2000 

Figure 127: A nucleic acid sequence of 2CFE114 deposited with the American Type 
Culture Collection as ATCC designation . on December 20, 2000. 

25 Figure 128: A nucleic acid sequence of 2CFE1 15 deposited with the American Type 
Culture Collection as ATCC designation 0 n December 20, 2000. 

Figure 129: A nucleic acid sequence of 2CFE1 16 deposited with the American Type 
Culture Collection as ATCC designation _______ on December 20 2000 

30 
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Figure 130: A nucleic acid sequence of 2CFE117 deposited with the American Type 
Culture Collection as ATCC designation on December 2Q> mQ 

Figure 131 : Schematic structures of alkyloids which are ligands, for example, of 2CFE42. 

DETAILED DESCRIPTTON OF THE INVFNTTniv 

Definitions 



10 All scientific and technical terms used in. this application have meanings commonly used 
the art unless otherwise specified. As used in this application; the following words 
phrases have the meanings specified. 



in 
or 
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As used herein, a ceg nucleic acid molecule is said to be "isolated" when the nucleic acid 
molecule is substantially separated from contaminant nucleic acid molecules that encode 
polypeptides other than CEGs. Additionally, isolated nucleic acid molecule refers to any 
RNA or DNA sequence obtained from a natural source, or constructed by recombinant 
methods, or synthesized. A skilled artisan can readily employ nucleic acid isolation 
procedures to obtain an isolated nucleic acid molecule having ceg sequences. 

The term "ceg" includes all isolated forms of ceg nucleotide and CEG amino acid sequences 
disclosed herein. The ceg sequences encode gene products that have essential biological 
functions in bacterial cells, such as ? for example, nucleotide biosynthesis, amino acid 
biosynthesis, DNA replication, RNA transcription, protein translation, DNA 
recombination, DNA repair, biosynthesis of cofactors (e.g., Coenzyme A), biosynthesis 
of prosthetic groups, cellular processes (e.g., chaperones, cell division, and polypeptide 
secretion), energy metabolism (e.g., pentose phosphate pathway, glycolysis, 
gluconeogenesis), fatty acid biosynthesis, cell wall biosynthesis, and/or biosynthesis of 
purines, pyridines, nucleosides, and nucleotides. Accordingly, the gene products of the 
ceg nucleotide sequences are required for viability of bacterial cells. The term "ceg" also 
includes variants having nucleotide sequence similarity to the disclosed ceg sequences, 
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including sequences isolated from various bacterial genera and species, allelic variants, 
mutant variants, and ceg variants that encode conservative and non-conservative amino acid 
substitutions. The present invention also provides for all ceg sequences generated by 
recombinant DNA technology, including complementary sequences, ceg sequences that 
5 hybridize to the sequences of the invention at high stringency hybridization conditions, 
fusion genes comprising a ceg sequence, and codon usage variants. 

The term "essential genes" refers to a nucleotide sequence that encodes a gene product 
having a function which is required for cell viability. The term "essential protein" refers 
10 to a polypeptide that is encoded by an essential gene and has a function that is required 
for cell viability. Accordingly, a mutation that disrupts the function of the essential gene 
or essential proteins results in a loss of viability of cells harboring the mutation. 

"Non-essential genes" or "non-essential proteins" refer to genomic information or the 
15 protein(s) or RNAs encoded therefrom which, when disrupted by a mutation, do not 
result in a loss of viability of cells harboring said mutation under defined laboratory 

conditions. 

As used herein, a nucleotide sequence is said to be "identical" to another reference 
20 sequence when both nucleotide sequences are exactly alike. 

As used herein, a nucleotide sequence is said to be "similar" to another reference 
sequence when a comparison of the two sequences shows that they have a low level of 
sequence differences. For example, two sequences are considered to be similar to each 
25 other when the percentage of nucleotides that are shared between the two sequences is 
between about 70 % to 99.99% over the entire length of the two sequences. 

As used herein an amino acid sequence is said to be "similar" to another reference 
sequence when a comparison of the two sequences shows that they have a low level of 
30 sequence differences. For example, two sequences are considered to be similar to each 
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other when the percentage of amino acids that are shared between the two sequences may 
be between about 30% to 1 00% identity over the entire length ofthe two sequences. 

As used herein, an "allele" or "allelic sequence" is an alternative form of the naturally- 
5 occurring ceg sequence. Alleles result from a mutation, that changes the nucleotide 
sequence, and generally produce altered mRNAs or polypeptides whose structure or 
function may or may not be altered. 

"Substantially purified" as used herein means a specific isolated nucleic acid or protein, 
10 or fragment thereof, in which substantially all contaminants (i.e. substances that differ 
from said specific molecule) have been separated from said nucleic acid or protein. 

In a host cell, an "endogenous" sequence as used herein means a nucleic acid sequence 
that is natiirally-occurring and resides within the host genome. 
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In a host cell, an "exogenous" sequence as used herein means an isolated nucleic acid 
sequence that is introduced into the host cell, using any one of a variety of introduction 
methods, such as transection, electroporation, cationic lipid or salt treatment methods. 
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"Knockout mutant" or "knockout mutation" as used herein refers to an in vitro engineered 
disruption of a region of endogenous chromosomal DNA (e.g., disruption ofthe genome), 
typically within a protein coding region. A knockout mutation can be generated by 
inserting an exogenous DNA sequence into the homologous endogenous sequence: A 
knockout mutation occurring in a protein coding region is expected to disrupt normal 
25 expression of the protein coding region. This usually leads to loss of the function 
provided by the protein. 

In order that the invention herein described may be more fully understood, the following 
description is set forth. 

0 
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1) CEG NUCLF.TC ACID MOT irrm .vs 

5 The present invention provides isolated and recombinant ceg nucleic acid molecules and 
fragments thereof, and related molecules, such as sequences complementary to ceg 
sequences or a portion thereof, and those that hybridize to the nucleic acid molecules of 
the invention. 



10 The ^polynucleotide sequences, also referred to herein as nucleic acid molecules of the 
invention, are preferably in isolated form, including DNA, RNA, DNA/RNA hybrids, and 
related molecules, and fragments thereof. Specifically contemplated are genomic DNA 
ribozymes, and antisense molecules, as well as nucleic acid molecules based on ax! 
alternative backbone or including alternative bases, whether derived from natural sources or 
15 synthesized. Embodiments of particular ceg polynucleotide and amino acid sequences 
include, but are not limited to, the sequences described in Tables I and II (e g SEQ ID 
NOS.-1-113, 114-226 and SEQ ID NOS: 227-339, 340-452, respectively). ' T he ceg 
polynucleotide and amino acid sequences were designated cfe which stands for CEG For 
Expression. 
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Biological samples of the 2CFE nucleic acid molecules (e.g., SEQ ID NOS: 227-331) 
were deposited on December 20, 2000 with the American Type Culture Collection 
(ATCC), 10801. University Blvd., Manassas, VA 201 10-2209. 

TABLE I 



CFE Designation 


SEQ. ID NO. 
(Nucleotide) 


SEQ. ID NO. 
(Polypeptide) 


POLARITY 


CFE1 


1 


114 


+ 


CFE 2 


2 


115 




CFE 3 


3 


116 




CFE 4 


4 


117 • 


• + 


CFE 5 


5 


118 




CFE 6 


6 


119 


+ 
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oltQ. ID NO. 
(Nucleotide) 


SEQ.IDNO. POLARITY 
(Polypeptide) 


CFE 7 


/ 


120 


_ 


CFE 8 


o 


121 


+ 


CFE 9 


Q 

y 


122 


+ 


CFF 10 


1 A 

1U 


123 


+ 


CFF 1 1 


1 1 

1 1 


124 • 


+ 


CFF 12 


1 9 

Iz 


125 


+ 


CFF n 




126 




CFE 14 


14 


127 


+ 


CFE 15 

wX X_/ X .J 


1 c 

Id 


128 - 






16 


129 




CFF 17 


17 


130 




PT7P 1 Q 


18 


131 


+ 


^rH Zl 


19 


132 




PFF? 0/1 
v^rxi Z4 


20 


133 




PFT? 9<; 


21 


134 




PFP 9/c 
LrE ZO - 


22 


135 




PFF? 97 


23 


136 


+ - 


PFF 9Q 
v^jTx_, Zo 


24 


137 




CFF 90 


25 


138 





PFF in 


26 


139 




PFP 1 1 


27 


140 




LrJci j 2 


28 


141 






29 


142 




PFF 14 


30 


143 


+ 


PFF ^ 


31 


144 


+ 


PFF 16 


32 


145 


+ 


PFF 37 


33 


146 


- 


PFF 38 


34 


147 


+ * 


PFF 3Q 


35 


148 




CFE 40 


36 


149 




CFE 41 


37 


150 




CFE 42 


38 


151 




CFE 43 . 


39 


152 




CFE 44 


40 


153 


+ 


CFE 45 


41 


154 




CFE 46 .. 42 


, 155 
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TABLE I 



V^fH* ISvalgllallOIl 


5>EQ, ID NO. 
(/Nucleotide) 


SEQ.IDNO. POLARITY 
(Polypeptide) 


PFF 47 


A O 

43 


156- J 


CFF 48 


44 


157 




PFF 40 


A C 

45. 


158 


+ 


CFF SO 


46 


159 


+ 


PFF 51 


47 


160 


+ 


CFF 59 


45 


161 




PFF 53 


49 


162 


+ 


PFF 54 


50 


163 


+ 


CW 55 


51 


164 


+ 


PFF 5 A 


52 


165 




PTTT7 ^7 


53 


166 


+ 


PFF 58 


54 


167 


4- 


PFF 50 


55 


168 




PFF £fi 


56 


169 


-f. 


PFF A1 


57 


170 


4- 

1 




58 


171 




prr; s->i 
L,r Jb oi 


59 


172 




pun /T/i 

Crli 64. 


60 


173 


-f 


Crb 65 


61 


174 


+ 


CrJb 66 


62 


175 


-f 


CrU- o/ 


63 


176 


+ 


r^TJT? 


64 


177 




PFF £Q 
v^rn oy 


65 


178 


- + 


PFF 7fl 


66 


. 179 




PFF 71 


67 


180 




PPP 70 


68 


181 




PPP 7*5 


69 


182 


+ 


PPT? 7/f 

^r±i /4 


70 


183 




CrJb /5 


71 


184 




CFE 76 


72 


.185 


• + 


CFE77 


73 


186 




CFE 78 


74 


187 


.+ • 


CFE 79 


75 


188 




CFE 80 


76 


189 




CFE 81 


77 


190 


+ 


CFE 82 


78 


191 
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TABLE I 



CP'S 1 , T^PClfTtl o4~i/"lT-t 

v ^- a - *-> x/cMgudiiun 


CPA TT\ TVT^fc 

MLQ. ID NO. 
(iiucieonde) 


SEQ.ro NO. POLARITY 
(Polypeptide) 


CFE 83 


70 

/y 


192 




CFE84 


CO 


193 


_ 


CFE 85 


CI 
ol 


194 




CFE 86 


SO 


195 


_ 


CFE 87 




196 




CFE 88 


24 


197 . 


_ 


CFE 89 


OJ 


198 




CFE 90 


fi£ 
oo 


199 




CFE 91 


C7 


200 




CFE 92 


QQ 
oo 


201 




CFE 91 


OO 

oy 


202 




CFF 94 


OA 

yu 


203 




CFF 95 

v^i. jo _7_/ 


01 

yi 


204 




CFF Q£ 


92 


205 




PFF 07 


93 


206 




ppp no 


94 


207 




PFF GO 
C yy 


95 


208 




CFF 100 

V_>1 JO I \J\J . 


O/T 

96 


209 




CFF 101 


0*7 

yy 


210 




CFF 10? 


y© 


211 




CFF 101 


oo 

yy 


212 




CFF 1 04 

\sl JO 1 V*T 


1 oo 

100 


213 


H- 


CFF 10S 

V-'X JO 1 Uj 


1 01 

1U1 


214 




CFE 1 06 




215 




CFF 107 


103 


216 




CFF 1052 


1 f\A 

104 


217 


+ 


CFF 10Q 

jo i \jy 




218 




CFE1 1 0 




219 




CFE 111 


■ 107 


220 




CFE 112 


108 


221 




CFE 113. 


109 


222 




CFE 114 


110 . 


223 




CFE 115 


Ill 


224 




CFE 116 


112 


225 




CFE 117 


113 


226 
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TABLE n 



CFE Designation 



2CFE 1 



2CFE2 



2CFE3 



SEQ. ID NO. 
(Nucleotide) 



2CFE4 



2CFE5 



2CFE6 



2CFE7 



2CFE8 
2CFE9 



2CFE 10 



2CFE11 



2CFE 12 



2CFE 13 



2CFE 14 



2CFE 15. 



2CFE 16 



2CFE 17 



2CFE 19 



2CFE21 
2CFE 24 



2CFE 25 
2CFE26 



SEQ. ID NO. 
(Polypeptide) 



FIGURE 



26 



27 



28 



29 



30 
31 



32 



33 
34 



_35_ 
36 



37 



38 



39 



40 



41 



42 
43 



44 
45 



46 



2CFE27 



2CFE 28 



2CFE29 



2CFE 30 



2CFE31 



2CFE 32 



2CFE 33 



47 



48 



49 



50 



51 



52 



53 



54 



2CFE35 
2CFE 36 



2CFE37 



2CFE 38 
2CFE39 



55 



56 
57 
58 



59 



60 
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CFE.Designation 

2CFE 40 
2CFE 41 
2CFE 42 
2CFE43 
2CFE 44 
2CFE45 
2CFE 46 
2 CFE 47 
2CFE48 
2CFE 49 
2CFE 50 
2CFE51 


SEQ. ID NO. 
(Nucleotide) 


SEQ. ID NO. 
(Polypeptide) 


FIGURE 

61 
62 
63 
64 
65 
66 

0/ 

68 
69 
70 

71 . 


2CFE 52 
2CFE 53 
2CFE 54 
2CFE 55 
2CFE 56 
2CFE57 
2CFE 58 

zCrb 60 
2CFE 61 
2CFE 62 






72 
73 
74 

75- 

76 

77 

78 
. 79 
80 
81 
o2 
83 
84 








85 


^v/rii 0/ 






86 


/LrH Oo 






87 
88 


^Crxs oy 






89 


o/^cr? an 
zCri} /U 


• 




90 


zCrE 71 






91 


2CFE 72 • 






92 


2CFE 75 






93 


2CFE76 






94 


2CFE 78 






95 


2CFE79 






96 


2CFE 80 






97 
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criL designation 


SEQ. m NO. 
(Nucleotide) 


| SEQ.ro NO. 
(Polypeptide) 


FIGURE 








98 


2 CFE 82 






99 


2CFE83 






100 








101 


^K^rc, oj 






102 


<£\->rn, oo 
2 CFE 87 






103 


2CFE 88 






1U4 

105 








106 


2CFE90 






107 


2CFE 91 
2CFE 92 






108 


2CFE 94 






109 
110 


2CFE 95 
2CFE 96 






111 


2CFE 97 
2CFE 99 
2CFE 101 






112 
113 
114 


2CFE 102 
• 2CFE103 
2CFE 104 






115 
116 
117 


2 CFE 105 
2CFE 106 
2CFE 107 
2CFE 108 
2CFE 109 




- 


118 
119 • 
120 
121 
122 


2CFE111 
2CFE 1 12 
2CFE113 
2CFE114 
2CFE115 






123 
124 
125 
126 
127 


2CFE116 






128 
129 


2CFE117 






130 
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a) Variants Nucleotide Sequences 

Tie present invention also provides nucleic acid molecules having a nucleotide sequence 
substantially identical or similar to the ceg sequences (SEQ ID NOS: 1-113, 227-331) 

disclosed herein. 

The present invention provides nucleotide sequences which are similar to SEQ ID 
NOS:l-113 and/or SEQ ID NOS:227-331. . The present invention provides nucleotide 
sequences which vary from SEQ ID NOS:M13 or 227-331 by a range of about 1% to 

about 70%, 



The present invention encompasses variations in polynucleotide sequences resulting from 
mutations and/or from transfer of genetic material from one cell to another (e.g., 
15 horizontal gene transfer or horizontal gene exchange). 

The present invention also provides for variants of the polynucleotide ceg sequences 
disclosed herein, including variants isolated from naturally-occurring sources those 
generated by recombinant DNA technology or other in vitro synthesis methodologies 
20 (e.g., PGR). The variant polynucleotide sequences of the invention encode polypeptides 
that exhibit the biological activity of naturally-occurring CEG polypeptides, such as 
activity required for bacterial cell viability. 

In general, for example, a variant of ceg polynucleotide sequences may encode a 
25 polypeptide that differs by one or more amino acid substitutions. The variant may have 
conservative changes, wherein a substituted amino acid has similar structural or chemical' 
properties, eg, replacement of leucine with isoleucine. 

A polynucleotide sequence can encode conservative amino acid substitutions without 
30 altering either the conformation or ^function of the polypeptide. Such changes include 
substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these 
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hydrophobic amino acids; aspartic acid (D) for glutamic acid (E) and vice versa; 
glutamine (Q) for asparagine (N) and vice versa; and serine (S) for threonine (T) and vice 
versa. Other substitutions can also be considered conservative, depending on the 
environment of the particular amino acid and its role in the three-dimensional structure of 
the protein. For example, glycine (G) and alanine (A) can frequently be interchangeable, 
as can alanine (A) and valine (V). Methionine (M), which is relatively hydrophobic, can' 
frequently be interchanged with leucine and isoleucine, and sometimes with valine. 
Lysine (K) and arginine (R) are frequently interchangeable in locations in which the 
significant feature of the amino acid residue is its charge and the differing pK's of these 
two amino acid residues are not significant. Still other changes can be considered 
"conservative" in particular environments. 



A variant may also have nonconservative changes, eg, replacement of a glycine with a 
tryptophan. Other variations may also include amino acid deletions or insertions, or both. 
15 Guidance in determining which and how many amino acid residues may be substituted, 
inserted or deleted without abolishing biological or immunological activity may be found 
using computer programs well known in the art, for example, DNASTAR software. 

Another type of ceg sequence variant includes naturally-occurring allelic variants of ceg 
20 which share significant sirmlarity (e.g., between about 30- 99%) to the disclosed CEG 
polypeptide sequence. Allelic variants of the ceg sequences can encode conservative or 
non-conservative amino acid substitutions of the CEG polypeptide sequence herein 
described. 

25 An example of allelic variants of ceg are mutant alleles of ^polynucleotide sequences that 
encode a polypeptide having one or more changes in the polypeptide sequence, such as 
amino acid substitutions, deletions, insertions, frame shifts, or truncations. The mutant 
alleles of ceg may or may not encode a CEG polypeptide having the same biological 
functions as wild-type CEG proteins. 

30 
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Variations in the bacterial genomic sequences can also arise from transfer of genetic 
material to another bacterial cell. The transfer of gene sequences can occur intraspecies 
or interspecies. Gene transfer can occur between bacterial cells which are members of 
Ihe same or different populations. A population includes, but is not limited to, a serotype 
isolate, a clinical isolate, a mturally-occurring isolate, a strain, and a species. The 
transfer of genetic material can occur between cells within a population; for example 
transfer between serotype A to serotype A, or between S. pneumoniae and S. 
pneumoniae. The transfer of genetic material can occur between cells of different 
populations; for example, between serotype A to serotype B or S. pneumoniae and S. 



10 mutarts, 
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20 
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Gene transfer can give rise to mutant or polymorphic variant genes sequences. In rare 
cases, gene transfer introduces new gene sequences that confer a new phenotype, such as 
antibiotic resistance. The transfer of genetic material includes transfer of large regions of 
genomic sequences which include partial gene sequences, whole single gene sequences, 
or multiple gene sequences. This mode of transfer can give rise to replacement of native' 
whole gene sequences or introduction of new sequences in the recipient cell. This mode 
of transfer gives rise to mosaic gene sequences in the recipient cell. 

The variation of genomic sequences resulting from gene transfer can be examined using 
molecular techniques, including: mulhiocus enzyme electrophoresis (Selander. R. K., et 
al., 1986 Appl. Environ. Microbiol. 51:837-884); and restriction endonuclease cleavage 
electrophoretic profiling (Coffey, T. J., et al., 1991 Mol. Microbio. 5:2255-2260); pulse- 
field gel electrophoresis fingerprinting (Bygraves, J. A. and Maiden, M. C. J. 1992 J. 
Gen. Microbiol 138:523-531); and ribotyping (Stull, T. L., et al., 1988 J. Infect. Dis. 
157:280-286). The degree of variation can vary greatly, and ranges from little or no 
variation as exemplified by gene sequences of E coli (Caugant, d. A. 3 et al., 1981 
Genetics 98:467-490; Whittam, T. S., et al., 1983 Mol. Biol. Evol. 1:67-83; Souza,' V., et 
al., 1992 Proc. Natl. Acad. Sci. USA 89:8389-8393) and Salmonella (Selander, R.' K.,' et 
al., 1990 Infect. Immun. ttaiGl-ms- Selander, R.K. and Smith, K H. 1990 Rev. Med. 
Microbiol. 1:219-228; Smith, J. M., et al., 1993 Proc. Natl. Acad Sci. USA 90:4384- 
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4388), to extensive gene transfer in Neisseria gonorrhoeae (Smith, J. M., et al., 1993 
Proc. Natl. Acad. Sci. USA 90:4384-4388). 

Gene transfer can be examined between various isolates of a particular microbial species 
5 which are antibiotic-sensitive or antibiotic-resistent (Coffey, T. J., et al., 1991 Molec 
Microbiol. 5:2255-2260). Molecular biology techniques can be utilized to study the 
degree of transfer between populations, such as, for example, the degree of gene transfer 
between serotypes, isolates, strains/or species . The degree of transfer can be examined 
by comparing, for example, the penicillin binding proteins and numerous different loci 
1 0 which encode metabolic enzymes or capsular biosynthesis enzymes. 

For example, intra-species, inter-serotype, gene transfer is possible (Coffey, T. J., et al., 
1991 supra). Additionally, intraspecies gene transfer in S. pneumoniae (Coffey, T. J., et 
al., 1998 Mol. Microbiol. 27:73-83), Vibrio cholerae (Bik, E. M., et al., 1995 EMBO J. 
15 14:209-216), and Haemophilus influenzae (Kroll, J. S. and Moxon, E. R. 1990 J. 
Bacteriol. 172: 1374-1379) are possible. 

Interspecies gene transfer is also possible (Dowson, C. G., et al., 1989 Proc. Natl. Acad 
Sci. USA 86:8842-8846; Laibl, G., et al., 1991 Mol Microbiol. 5:1993-2002; Bourgoin, 
20 F.,etal., 1999 Gene 233:151-161). 

Variant gene sequences arising from gene transfer can be continually generated in 
transformable bacteria (e.g., transformation competent), such as S. pneumoniae. For 
example, the worldwide spread of varying degrees of antibiotic resistance has. been 

25 documented and reviewed (Dowson, C. G., et al., 1994 Trends Microbiol 2:361-366; 
Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1994 pp. 517- 
534; and reviewed in Maiden, M. C. J. 1998 Clinic. Infect. Dis. 27 (Supplement 1) S12- 
S20). For example, variant gene sequence arising from gene transfer can be tracked 
using a marker gene such as the gene which encodes the penicillin binding protein 

30 (Barcus, V. A., et al., 1995 FEMS Microbiol. Lett. 126:299-303). At the nucleotide level, ' 
gene sequences encoding the penicillin binding proteins in susceptible and resistant' 



32 



10 



WO 01/49721 

PCT/USOO/35604 

strains differ by about 14% to 23% (Hakenbeck, R. 1995 Biochem. Pharmacol. 50:1121- 
1 127; Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1 994 pp. 
517-534; Spratt, B. G., et al., 1991 Neisseria meningitidis and Streptococcus pneumoniae 
eds. Camisi, J., et al., pp. 73-83; Coffey, T. J., et al., 1995 Micro. Drug Resist. 1:29-34). 

The ceg nucleotide sequences can be isolated from various species of Streptococcus 
including Streptococcus pneumoniae. Additionally, the ceg sequences can be isolated from 
other Steptococcal species, including S. mutans, S. pyogenes, and S. thermophila, The ceg 
polynucleotide sequences can also be isolated from strains of other bacterial genera 
including, but not limited to, Streptococcus, Escherichia, Bacillus, Pseudomonas, 
Yersinia, Salmonella, sad Haemophilus. 



The present invention additionally provides isolated codon-usage variants that differ from 
the disclosed ceg nucleotide sequences, yet do not alter the predicted CEG polypeptide 
15 sequence or function. The codon-usage variants may be generated by recombinant DNA 
technology. Codons may be selected to optimize the level of production of the ceg 
transcript or CEG polypeptide in a particular prokaryotic or eukaryotic expression host, 
in accordance with the frequency of codon utilized by the host cell. Alternative reasons 
for altering the nucleotide sequence encoding a CEG polypeptide include the production 
20 of RNA transcripts having more desirable properties, such as an extended half-life or 
increased stability. A multitude of variant ceg nucleotide sequences that encode the 
respective CEG polypeptide may be isolated, as a result of the degeneracy of the genetic 
code. Accordingly, the present invention contemplates selecting every possible triplet 
codon to generate every possible combination of nucleotide sequences that encode the 
25 disclosed CEG polypeptides. This particular embodiment provides isolated nucleotide 
sequences that vary from the sequences as described in SEQ ID NOs.: 1-113 or 227-331, 
such that each variant nucleotide sequence encodes a polypeptide having sequence 
identity with the amino acid sequences, as described in SEQ ID NOs.:l 14-226 or 332- 
436, respectively. 

30 
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b) Complementary Sequences 



The present invention includes polynucleotide sequences that are complementary to the 
5 sequences disclosed herein. The term "complementary" as used herein refers to the 
capacity of purine and/or pyrimidine nucleotides to associate through hydrogen bonding 
to form double stranded nucleic acid molecules. The following base pairs are related by 
complementarity: guanine and cytosine; adenine and thymine; and adenine and uracil. 
Complementary applies to all base pairs comprising at least two single-stranded nucleic 
10 acid molecules. 



c) Sequences Capable of Hybridizing 

• Another embodiment provides nucleic acid molecules that will hybridize to ceg 
15 sequences under hybridization conditions. It is readily apparent to one skilled in the art 
that the stringency of the hybridization condition selected will depend upon the 
characteristics of the nucleic acid molecule to be hybridized, such as, the length, the 
degree of complementarity (e.g., exact or non-exact complementarity), the percent A/T 
content, and the objective of the hybridization experiment. 

20 

The hybridization procedure may by performed in low stringency hybridization 
conditions. Low stringency hybridization conditions will permit hybridization between 
two nucleic acid molecules that differ from exact complementarity by about 25% to 70%. ' 
Hybridization under standard high stringency conditions will occur between two 
25 complementary nucleic , acid molecules (e.g., 100% exact complementarity) or two 
complementary nucleic acid molecules that differ from exact complementarity by about 
1% to about 70%. 



30 



The high stringency hybridization conditions that disfavor non-homologous base pairing 
are well known in the art. Typically, high stringency hybridization conditions, includes 
but is not limited to, hybridizing at 50 °C to 65 °C in 5X SSPE, and washing at 50 °C to 
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65 °C in 0.5X SSPE. Typically, low stringency conditions, includes but is not limited to, 
hybridizing at 35 °C to 37 °C in 5X SSPE and 40% to 45% formamide and washing at 42 
°C in 1-2X SSPE. The conditions and formulas for high stringency hybridization 
methods are well known in the art and can be readily obtained in Molecular Cloning; A 
Laboratory Manual (2 nd edition, Sambrook, Fritch, and Maniatis 1989, Cold Spring 
Harbor Press) or in Short Protocols in Molecular Biology (Ausubel, F. M., et al., 1989, 
John Wiley & Sons).- 

d) Fragments of ceg Sequences 

The invention further provides nucleic acid molecules having fragments of the ceg 
sequences, such as a portion of the ceg sequence (e.g., SEQ ID NOS:l-113, 227-331) 
disclosed herein. The size of the fragment will be determined by its intended use. For 
example, the length of the fragment to be used as a nucleic acid probe or PCR primer is 
chosen to obtain a relatively small number of false positives during probing or priming. 
Alternatively, a fragment of the ceg sequence may be used to construct a recombinant fusion 
gene having a ceg sequence fused to a non-ceg sequence. 

The nucleic acid molecules, fragments thereof, and probes and primers of the present 
invention are useful for a variety of molecular biology techniques including, for example, 
hybridization screens of libraries, or detection and quantification of mRNA transcripts as 
a means for analysis of gene transcription and/or expression. Preferably, the probes and 
primers are DNA. A probe or primer length of at least 15 base pairs is suggested by 
theoretical and practical considerations (Wallace, B. and Miyada,. G. 1987 
25 "Oligonucleotide Probes for the Screening of Recombinant DNA Libraries" in: Methods ■ 
in Enzymology, 152:432-442, Academic Press). Other lengths of fragments, probes, or 
primers are possible and routine to determine. 

• The probes and primers of this invention can be prepared by methods well known to 
30 those skilled in the art (Sambrook, ef al. supra). In a preferred embodiment the probes 
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and primers are synthesized by chemical synthesis methods (ed: Gait, M. J. 1984 
Oligonucleotide Synthesis, IRL Press, Oxford, England). 

One embodiment of the present invention provides nucleic acid primers that are 
5 complementary to ceg sequences, which allow the specific amplification of nucleic acid 
molecules of the invention or of any specific parts thereof. Another embodiment 
provides nucleic acid probes that are complementary for selectively or specifically 
hybridizing to the ceg sequences or to any part thereof. 

10 e) Derivative Nucleic Acid Molecules 

Hie nucleic acid molecules of the invention include peptide nucleic acids (PNAs), or 
derivative molecules such as phosphorothioate, phosphotriester, phosphoramidate, and 
methylphosphonate, that specifically bind to single-stranded DNA or RNA in a base'pair- 
15 dependent manner (Zamecnik, P. C, et al, 1978 Proc. Natl. Acad. Sci. 75:280284; 
Goodchild, P. C, et al., 1986 Proc. Natl. Acad. Sci. 83:4143-4146). 

PNA molecules comprise a nucleic acid oligomer to which an amino acid residue, such as 
lysine, and an amino group have been added. These small molecules, also designated 
20 anti-gene agents, stop transcript elongation by binding to their complementary (template) 
strand of nucleic acid (Nielsen, P. E., et al., 1993 Anticancer Drug Des 8:53-63). For 
example, reviews of methods for synthesis of DNA, RNA, and their analogues can be 
found in: Oligonucleotides and Analogues, eds. F. Eckstein, 1991, IRL Press, New York; 
Oligonucleotide Synthesis, ed. M. J. Gait,' 1984, IRL Press, Oxford, England 
25 Additionally, methods for antisense RNA technology are described in U. S. patents 
5,1 94,428 and 5,1 10,802. A skilled artisan can readily obtain these classes of nucleic acid 
molecules using the herein described ^polynucleotide sequences, see for example 
Innovative and Perspectives in Solid Phase Synthesis (1992) Egholm, et al. pp 325-328 or 
U. S. Patent No. 5,539,082. 
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f) RNA Molecules 



The present invention provides RNA molecules that encode the predicted ceg gene 
5 products. In particular, the RNA molecules of the invention may be isolated full-length 
or partial mRNA molecules or RNA oligomers that encode CEG gene products. The 
RNA molecules of the invention include the nucleotide sequences encoding all or 
portions of CEGs. 

10 The RNA molecules of the invention also include antisense RNA molecules, peptide 
nucleic acids. (PNAs), or non-nucleic acid molecules such as phosphorothioate 
derivatives, that specifically bind to the sense strand of DNA or RNA in a base pair- 
dependent manner. A skilled artisan can readily obtain these classes of nucleic acid 
molecules using the herein described ceg sequences. 

15 

g) Labeled Nucleic Acid Molecules 

The nucleic acid molecules having ceg sequences can be labeled with a detectable 
marker. Examples of a detectable marker include, but are not limited to, a radioisotope a 
20 'fluorescent compound, a bioluminescent compound, a cherrulurninescent compound,' a 
metal chelator or an enzyme. Technologies for generating labeled DNA and RNA probes 
are well known in the art (See e.g. Sambrook et al., supra). 



25 



2 > RECOMBINANT NUCLFJr AnrDMOLFOTT FR 



Also provided are recombinant nucleic acid molecules, such as recombinant DNA molecules 
(rDNAs) that comprise ceg sequences or fragments thereof. As used herein, a recombinant 
DNA molecule is a DNA molecule that has been subjected to molecular manipulation in vitro. 
Methods for generating rDNA molecules are well known in the art, for example, see Sambrook 
30 et al., Molecular Cloning (1 989), supra. 
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The nucleic acid molecules of the invention may be recombinant molecules each 
comprising the sequence, or portions thereof, of a ceg sequence linked to a non-ce* 
5 sequence. For example, the ceg sequence may be fused operatively to a vector to 
generate a recombinant molecule. The term vector includes, but is not limited to, 
plasmids, cosmids, and phagemids. A preferred vector includes an autonomously 
replicating vector comprising a replicon that directs the replication of the rDNA within the 
appropriate host cell. The preferred vectors can also include an expression control 
10 element, such as a promoter sequence, which enables transcription of the inserted ceg 
sequences and can be used for regulating the expression (e.g., transcription and/or 
translation) of an operably linked ceg sequence in an appropriate host cell such as 
Escherichia coll Expression control elements are known in the art and include, but are not 
limited to, inducible promoters, constitutive promoters, secretion signals, enhancers, 
15 tianscription temmaton. and oflw Other expression 

control elements that are involved in translation are known in the art, and include the Shine- 
Dalgamo sequence, and initiation and termination codons. The preferred vector also 
includes at least one selectable marker gene that encodes a gene product that confers drug . 
resistance such as resistance to ampicillin or tetracyline. The vector also comprises 
20 multiple endonuclease restriction sites that enable convenient insertion of .exogenous 
DNA sequences. ' 

The preferred vectors for generating ceg transcripts and/or the encoded CEG polypeptides 
are expression vectors which are compatible with prokaryotic host cells. Prokaryotic cell ' 
25 expression vectors are well lcnown in the art and are available from several commercial 
sources. For example, a pET vectors (e.g., pET-21, Novagen Corp.) may be used to 
express CEG polypeptides in bacterial host cells. 
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b) Recombinant Vectors for Integration 

The present invention provides recombinant vectors that may be used to integrate 
5 exogenously provided sequences into the genome of a host cell. The recombinant 
integration vectors of the present invention include a gene that encodes a selectable 
marker and ceg sequences; or fragments thereof. The integration vectors are used to 
integrate the ceg sequence into a target gene sequence that resides within the bacterial 
host genome (e.g., endogenous sequence), thereby disrupting the function of the target 
10 gene sequence within the bacterial cells. These integration vectors may be used in a gene 
disruption assay to screen candidate ceg nucleotide sequences, in order to identify the 
candidate sequences that encode a gene product that is required for bacterial cell viability. 

Accordingly, these recombinant integration vectors include candidate ceg sequences that 
15 will be screened to determine if the candidate ceg sequences encode a gene product that 
is required for cell viability. The candidate ceg sequence that is included as part of the 
recombinant integration vector is the "exogenous" ceg sequence that is employed as the 
"disrupting" sequence in a gene disruption assay. The ceg sequence that resides within 
the host genome is the "endogenous" or "target" ceg sequence. 

20 

The integration event rarely occurs, for example, by non-homologous recombination in 
which a recombinant vector, that includes the exogenous ceg sequence, inserts the 
exogenous ceg sequence into a random location within the host genome. In a more 
preferred embodiment, the integration event inserts the exogenous ceg sequence into a 
25 specific target site within the host genome. The targeted integration event can involve 
homologous recombination in which the integration vector, that includes the exogenous 
ceg sequence, inserts the exogenous ceg sequence into its homologous target ceg 
sequence that resides within the host's genome (e.g., the endogenous ceg sequence) 
(Figure 1). Further, the exogenous ceg sequence can be used as a disrupting sequence 
30 whereby the homologous recombination event integrates the exogenous ceg sequence 
into the endogenous target ceg sequence resulting in disruption of the function of the 
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endogenous ceg sequence. For example, disrupting the function of the endogenous ceg 
sequence may result in the loss of bacterial cell viability. 

An example of a recombinant vector that can be used as an integration vector in S. 
5 pneumoniae is the pEVP-3 vector (Jean-Pierre Claverys, et al. 1995 Gene 164: 123-128). 
The pEVP-3 vector integrates an exogenous sequence by homologous recombination 
involving a Campbell-type event (S. Adhya and A. Campbell 1970 J. Mol. Biol. 50:481- 
490). The pEVP-3 vector includes a replicon that functions only in gram-negative 
bacteria, such as & coll Therefore, the pEVP-3 vector cannot replicate in S. 
10 pneumoniae. This vector also contains multiple cloning sites, and confers resistance to 
chloramphenicol in both a gram-negative and gram-positive bacteria, such as S. 
pneumoniae. 



15 



20 
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c) Fusion Gene Sequences 

A fusion ceg gene is another example of a recombinant molecule of the invention. A fusion 
gene includes a ceg sequence operatively fused (e.g., linked) to a non-ceg sequence such as, 
for example, a tag sequence to facilitate isolation and/or purification of the expressed 
CEG gene product (Kroll, D.J., et al, 1993 DNA Cell Biol 12:441-53). 

Alternatively, a recombinant fusion molecule has a ceg sequence of the invention fused to 
a ceg sequence isolated from a different microbial source. For example, the disclosed ceg 
sequences isolated from S. pneumoniae can be fused to a ceg sequence isolated from a 
different bacterial species. 

3.) CEG PROTEI NS AND POLYP EPTIDE MOTJTTTf .vx 



The invention additionally provides CEG proteins and peptide fragments thereof that are 
isolated or substantially purified. Embodiments of particular CEG amino acid sequences 
0 are disclosed in Tables I and II (SEQ ID NOS:l 14-226 and SEQ ID NOS:332-436, 
respectively). 
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The present invention also includes polypeptides having sequence variations from the 
predicted CEG polypeptide sequences disclosed herein, including mutant variants 
conservative substitution variants, and similar CEG polypeptides from other prokaryotic' 
organisms. For convenience, such proteins are referred to herein as "CEG proteins", 
"CEG polypeptides", or "proteins of the invention". 

As used herein, CEG protein refers to a polypeptide having amino acid sequence identity or 
similarity to any one of the predicted amino acid sequences, as provided in SEQ ID NO.: 
114-226 or 332-436. The variant CEG polypeptides can be allelic forms of CEG such as 
mutant forms of CEG polypeptides. The present invention also provides conservative 
substitution-mutants of the CEG proteins that maintain functional activity of wild-type CEG 
(e.g., the CEG polypeptide is required for bacterial cell viability). 

15 The CEG protein may be isolated from any source whether natural, synthetic, semi- 
synthetic, or recombinant. As used herein, "natural" refers to a polypeptide which is 
found in nature. Accordingly, the CEG proteins niay be isolated from a prokaryotic 
organism, such as a bacterial strain including, but not limited to, Streptococcus 
Escherichia, Bacillus, Pseudomonas, Yersinia, Salmonella, and Streptomyces. The CEG 

20 proteins of the invention, and fragments thereof, can also be generated by recombinant 
methods or chemical synthesis methods. 

The CEG polypeptides of the invention are essential for the viability of a bacterial cell. 
Further, the CEG polypeptides can exhibit at least any one of the following functions: a 
pantothenate kinase, a Holliday Junction branch migration protein, a single stranded 
DNA binding protein, a phosphoglucosamine mutase, an acetyltransferase, an 
uridylyltrahsferase, a malonyl CoenzymeA.ACP transcylase, a 3-oxoacyl-ACP synthase 
H a 3-oxoacyl-ACP reductase, a phosphomemylpyrimidine (HMP-P) kinase a GTP 
binding protein, a ATP binding protein, or a 4-aminoimidazole carboxylase. Putative 
functions can include, but are not limited to, sugar transferase, techoic acid biosynthesis, . 
ribosome recycling factor, response regulator, nicotinate phosphoribosyltransferase,' 
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nitropropane dioxygenase, (3R)-hydroxymyristol acyl carrier protein dehydrase, sugar 
dehydrogenase, murein biosynthesis, cobalimin biosynthesis, ABC transporter, tRNA 
modification enzyme, arylsulfatase, 16S processing enzyme, tRNA methyl transferase, 
elongation factor P, signal recognition particle, protein export, undecaprenol kinase, SRP 
docking domain, diacyl glycerol kinase, dihydopicilinate reductase, HU-DNA binding 
protein, thiamine biosynthase, GreA transcription elongation factor, dTDP-L-rhamnose 
synthase, ATP-binding motif, ribose-5i)-3-epirnerase-like activity, GTP 
pyrophosphokinase, acetyl-CoA carboxylase, O-sialoglycoprotein endopeptidase, 
glucosamine-fructose-6-phosphase aminotransferase, Strpn adhesion-associated ABC- 
permease, GTP pyrophosphokinase RelA, IMP dehydrogenase, DNA gyrase subunit B, 
acetyl-CoA carboxylase subunit AccD, phosphoglycerol kinase, acetyl-CoA carboxylase 
carbonyl transferase,, phosphopanthetheine adenylyltransferase, oligopeptide transport 
permease subunit, translocation protein, perM permease, DNA pol HI gamma and tau 
subunits, DNA pol HI delta subunit, signal peptidase I, acetyl-coA carboxylase biotin 
carboxyl carrier protein, protein chain release factor-1, replicative DNA helicase, 
topoisomerase, pentapeptide-transferase, elongation factor G, spore coat polysaccharide 
biosynthesis protein C, protein release factor B, DNA polymerase HI alpha subunit, 
phosphoprotein phosphatase, chaparonin, UDP-N-acetylmuramoylalanyl-D-glutamate-2, , 
6-diaminopimelate ligase, techuronic acid biosynthesis, UDP-glucose lipid carrier 
transferase, transcription termination factor, chromosome segregation factor, amino acid 
biosynthesis, HMG-CoA reductase, hypoxanthine-guanine phosphoribosyltransferase. 

a) MODULATO RS OF CEft PQLYPEPTTT)F.S 

The invention provides compounds that modulate (e.g., activate or inhibit) the function of 
a CEG polypeptide. Such compounds can provide lead-compounds for developing drugs 
for diagnosing and/or treating conditions associated with bacterial infections. The 
modulator is a" compound that may alter the function of the CEG polypeptide, such as 
activating or inhibiting the function of a CEG polypeptide. For example, the compound 
can act as agonist, antagonist, partial agonist, partial antagonist, cytotoxic agents, 
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inhibitors of cell proliferation, and cell proliferation-promoting agents. The activity of 
the compound may be known, unknown or partially known. 

Suitable ligands include, but are not limited to, diazalactones, ^protected amino acid, 
5 azabicyclodiene, and alkaloids. 



• An example of a diazalactone is: 




An example of a ^-protected amino acid is: 




A 

NH 

V 

O- 




An example of an azabicyclodiene is; 
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B) METHODS FOR MAKING THE rvr. prqtrtns Aivn Pnr ™ ?Tmr ~ 

Recombinant methods are preferred if a high yield is desired. Recombinant methods 
involve expressing the cloned gene in a suitable host cell. For example, a host cell is 
introduced with an expression vector having the CEG sequence, then the host cell is 
cultured under conditions that permit in v/vo production of the CEG protein The 
recombinant vector can integrate the CEG sequence into the host genome. Alternatively 
the CEG sequence can be maintained extra-chromosomally, as part of an autonomously 
replicating vector. 



15 



*• HOST-VECTOR SYSTEMS 

The invention further provides a host-vector system comprising the vector, plasmid 
Phagemid, or cosmid comprising a ceg nucleotide sequence, or a fragment thereof 
introduced into a suitable host cell. The host-vector system can be used to produce the' 
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CEG polypeptides encoded by the ceg nucleotide sequences. The host cell can be 
prokaryotic or eukaryotic. Examples of suitable prokaryotic host cells include bacteria 
strains from genera such as Escherichia, Bacillus, Pseudomonas, Streptococcus, and 
Streptomyces. Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, 
or an animal cell, such as a mammalian cell. A preferred embodiment provides a host-' 
vector system comprising the pET21 vector having a ceg sequence introduced into an E 
coli A.DE3 lysogen which is useful, for example for the production of the CEG protein, 
herein designated CFE polypeptides and CFE proteins. 

Introduction of the rDNA molecules of the present invention into an appropriate cell host is 
accomplished by well known methods that typically depend on the type of vector used and 
host system employed. For example, transformation of prokaryotic host cells by 
electroporation and salt treatment methods are typically employed, see for example, Cohen 
etal., 1972 Proc Acad Sd USA 69:2110; Maniatis, T., etal.,1989 Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. 
Transformation of vertebrate cells with vectors containing rDNAs, electroporatibn, cationic 
lipid or salt treatment methods are typically employed, see, for example, Graham et al, 
1973 Virol 52:456; Wigleretal., 1979 Proc Natl Acad Sci USA 76:1373-76. 

20 Successfully transformed cells, i.e., cells that contain a rDNA molecule of the present 
invention, can be identified by well known techniques. For example, cells resulting from 
the introduction of a rDNA of the present invention can be selected and cloned to produce 
single colonies. Cells from those colonies can be harvested, lysed and their DNA content 
examined for the presence of the rDNA using a method such as that described by Southern, 
25 JMol Biol (1975) 98:503, or Berent et al., Biotech (1985) 3:208, or the proteins produced 
from the cell assayed via a biochemical assay or immunological method. 

Procaryotes are generally used as host cells for cloning and producing the products of " 
exogenous DNA sequences. For example, the Escherichia coli K12 BL21 (XDE3) 
30 (Novagen) is particularly useful for expression of foreign proteins. Other strains of E. 
coli, and bacilli such as Bacillus subtilis, Enterobacteriaceae such as Salmonella 
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typhimurium or Serratta marcescans, various Pseudomonas, Streptococcus and 
Streptomyces species may also be employed as host cells in cloning and expressing the 

recombinant proteins of this invention. 

In general terms, the production of recombinant CEG proteins may involve using a 
host/vector system, or other methods may be used. The host/vector system may employ the 
following steps. 

A nucleic acid molecule is obtained that encodes a CEG protein or a fragment thereof, such 
10 as any one ofthe polynucleotides disclosed in SEQ ID NOs.: 1-113 or227-331. The CEG- 
encoding nucleic acid molecule is preferably inserted into an expression vector in operable 
linkage with suitable expression control sequences, to generate an expression vector 
including the CEG-encoding sequence. The expression vector is introduced into a suitable 
host, by standard transformation methods, and the resulting transformed host is cultured 
15 under conditions that allow the production ofthe CEG protein,For example, if expression 
of the CEG gene is under the control of an inducible promoter, then suitable growth 
conditions would include the appropriate inducer. The CEG protein (e.g., designated a 
CFE polypeptide or protein), so produced, is isolated from the growth medium or directly 
from the cells; recovery and purification ofthe protein may not be necessary in some 
20 instances where some impurities may be tolerated. A skilled artisan can readily adapt an 
appropriate host/expression system known in the art for use with CEG-encoding sequences 
to produce a CEG protein. (Cohen, et al., supra; Maniatis et al., supra). 

Host cells harboring the nucleic acids disclosed herein are also provided by the present 
5 invention. A preferred host is E. coli strain BL21(*DE3) transfected or transformed with 
a vector comprising a nucleic acid ofthe present invention. The invention also provides a 
host cell capable of expressing the ceg sequences described herein. The preferred host 
cell is any strain of £ coli that can accommodate high level expression of an exogenously 
introduced gene. 
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The proteins of the present invention can also be made by chemical synthesis The 
pnncrples of solid phase chemical synthesis of polypeptides are well known in the art and 
may be found in general texts relating to this area (Dugas, H. and Penney, C 1981 
Bvorganic Chemistry, pp 54-92, Springer-Verlag, New York). CEG polypeptides may 
be synthesized by solid-phase methodology utilizing an Applied Biosystems 430A 
peptide synthesizer (Applied Biosystems, Foster City, Calif.) and synthesis cycles 
supphed by Applied Biosystems. Protected amino acids, such as t-butoxycarbonyl- 
protected amino acids, and other reagents are commercially available from many 
chemical supply houses. 



10 



The polypeptides of the invention exhibit properties of a CEG protein, such as for 
example, the ability to elicit the generation of antibodies that specifically bind an epitope 
associated with CEG polypeptides. Accordingly, the CEG polypeptide, or any 
oligopeptide thereof, is capable of inducing a specific immune response in appropriate 
15 animals or cells and binding with specific antibodies. 




20 The mvention further provides antibodies (e.g., polyclonal, monoclonal, chimeric 
humanized, and human antibodies) that bind a CEG polypeptide. The most preferred 
antibodies will selectively bind a CEG polypeptide and will not bind (or will bind weakly) a 
non-CEG polypeptide. Antibodies that are particularly contemplated include monoclonal 
and polyclonal antibodies, as well as fragments thereof (e.g., recombinant proteins) which 
25 mclude the antigen binding domain and/or one or more complement determining regions of 
these antibodies. These antibodies can be from any source, for example, rabbit, sheep, rat, 
dog, cat pig, horse, mouse, and human. 

The invention encompasses antibody fragments that specifically recognize a CEG 
30 polypeptide. As used herein, an antibody fragment is defined as at least a portion of the 
vanable region of the immunoglobulin molecule that binds to its target, i.e., the antigen 
binding region. Some of the constant region of the immunoglobulin may be included. 
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As will be understood by those skilled in the art, the .regions or epitopes of a CEG 
polypeptide to which an antibody is directed may vary with the intended application For 
example, antibodies intended for use in an immunoassay for the detection of membrane- 
5 bound CEG proteins on viable bacterial cells should be directed to an accessible epitope 
on membrane r bound CEG proteins. Antibodies that recognize other epitopes may be 
useful for the identification of CEG protein within damaged or dying cells, for the 
detection of secreted CEG protein or fragments thereof. 

10 Various methods for the preparation of antibodies are well known in the art. For example 
antibodies may be prepared by immunizing a suitable mammalian host using a CEG protein,' 
peptide, orfragment,misolatedorirmnunoconjugatedform(Har^ 1989 Antibodies Cold 
Spring Harbor Press, NY). In addition, fusion proteins comprising CEG polypeptides may 
also be used, such as a CEG protein/GST-fusion protein. Cells expressing or overexposing 

15 a CEG polypeptide may also be used for immunizations. Similarly, any cell engineered to 
expressCEGproteinmaybeused. ^strategy may result inthe production of monoclonal 
antibodies with enhanced capacities for recognizing endogenous CEG protein. 

The present invention contemplates chimeric antibodies that comprise a human and non- 
human immunoglobin portion. The antigen combining region (variable region) of a 
chimeric antibody can be derived from a prokaryotic source (e.g., bacteria) and the 
constant region of the chimeric antibody which confers biological effector function to the 
immunoglobulin can be derived from a eukaryotic source (e.g., human). The chimeric 
antibody should have the antigen binding specificity of the prokaryotic antibody 
molecule and the effector function conferred by the eukaryotic antibody molecule. 

In one example, the procedure used to produce chimeric antibodies can involve the 

following steps: 

a) Identifying and cloning the correct immunoglobin gene segment encoding the 
antagen binding portion of the antibody molecule. This gene segment is known as 
the VDJ, variable, diversity and joining regions, for heavy chains or VJ, variable, 
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joining regions for light chains or simply as the V or variable region. This gene 
regions may be in either the cDNA or genomic form; 

b) Cloning the gene segments encoding the constant region or desired part thereof; 

c) Ligating the variable region with the constant region so that the complete chimeric 
5 antibody is encoded in a form that can be transcribed and translated; 

d) Ligating this construct into a vector containing a selectable marker and gene control 
regions such as promoters, enhancers and poly(A) addition signals; 

e) Amplifying this construct in bacteria; 

f) Introducing this DNA into eukaryotic cells (transfection) most often mammalian 
10 lymphocytes; 

g) Selecting for cells expressing the selectable marker; 

h) Screening for cells expressing the desired chimeric antibody; and 

k) Testing the antibody for appropriate binding specificity and effector functions. 
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Chimeric antibodies of several distinct antigen binding specificities have been produced 
by protocols well known in the art, including anti-TNP antibodies (Boulianne et al., 1984 
Nature 312:643); and anti-tumor antigen antibodies (Sahagan et al., 1986 J. Immunol. 
137:1066). Likewise, several different effector functions have been achieved by linking 
new sequences to those encoding the antigen binding region. Examples of these include 
enzymes (Neuberger et al., 1984 Nature 312:604); immunoglobulin constant regions 
from another species and constant regions of another immunoglobulin chain (Sharon et 
al,l9$4 Nature 309:364; Tan et al., 1985 J. Immunol. 135:3565-3567). Additionally, 
procedures for modifying antibody molecules and for producing chimeric antibody 
molecules using homologous recombination to target gene modification have been 
described (Fell et al., 1989 Proc. Natl. Acad. Sci. USA 86:8507-851 1). 

The predicted amino acid sequence of a CEG protein may be used to select specific regions 
of the CEG protein for generating antibodies. For example, hydrophobicity and 
hydrophilicity analyses of a CEG polypeptide may be used to identify hydrophobic and 
hydrophilic regions in the CEG protein. Regions of the CEG protein that show 
immunogenic structure, as well as other regions and domains, can readily be identified using 
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various other methods known in the art, such as Chou-Fasman, Garnier-Robson , Kyte- 
Doolittle, Eisenberg, Karplus-Schult or Jameson-Wolf analysis. Fragments that include the 
immunogenic regions are particularly suited for generating specific classes of antibodies. 

5 Methods for preparing a protein for use as an immunogen and for preparing immunogenic 
conjugates of a protein with a carrier such as BSA, KLH, or other carrier proteins are well 
known in the art. In some circumstances, direct conjugation using, for example, 
carbodiirnide reagents may be used; in other instances linking reagents such as those 
supplied by Pierce Chemical Co., Rockford, IL, may be effective. Administration of a CEG 
10 immunogen is conducted generally by injection over a suitable time period and with use of a 
suitable adjuvant, as is generally understood in the art. During the immunization schedule, 
titers of antibodies can be taken to determine adequacy of polyclonal antibody formation. 

While the polyclonal antisera produced in this way may be satisfactory for some 
applications, for pharmaceutical compositions, monoclonal antibody preparations are 
preferred. Immortalized cell lines which secrete a desired monoclonal antibody may be 
prepared using the standard method of Kohler and Milstein {Nature 256: 495-497) or other 
techniques as described in Monoclonal Antibodies; A Manual of Techniques, CRC press, 
Lie, Boca Raton, Fla.(1987) ed. Zola. The immortalized cell lines secreting the desired 
antibodies are screened by immunoassay in which the antigen is the CEG polypeptide 
having binding activity, or a fragment thereof. When the appropriate immortalized cell 
culture secreting the desired antibody is identified, the cells can be cultured either in vitro or 
by production in ascites fluid. 

25 The desired monoclonal antibodies are then recovered from the culture supernatant or from 
the ascites supernatant. Fragments of the monoclonal antibodies of the invention or the 
polyclonal antisera (e.g., Fab, ?(zb% Fv fragments, fusion proteins) which contain the 
immunologically significant portion (i.e., a portion that recognizes and binds a CEG protein) 
can be used as antagonists, as well as the intact antibodies. Humanized antibodies directed 
against a CEG polypeptide are also useful. The advantage of using humanized antibodies is 
mat they are less immunogenic in humans. As used herein, a humanized antibody is an 
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immunoglobulin molecule which is capable of binding to a CEG polypeptide and which 
comprises a FR region having substantially the amino acid sequence of a human 
immunoglobulin and a CDR having substantially the amino acid sequence of non-human 
immunoglobulin or a sequence engineered to bind a CEG protein. Methods for humanizing 
murine and other non-human antibodies by substituting one or more of the non-human 
antibody CDRs for corresponding human antibody sequences are well known (Jones et al., 
1986 Nature 321: 522-525; Riechmnan et al., 1988 Nature 332: 323-327; Verhoeyen et al.,' 
1988 Science 239: 1534-1536; Carter et al., 1993 Proc. Natl. Acad Sci. USA 89: 4285;' 
and Sims et al., 1 993 J. Immunol. 15 1 : 2296). 



Use of immunologically reactive fragments, such as the Fab, Fab\ or F(ab^ fragments is 
often preferable, especially in a therapeutic context, as these fragments are generally less 
immunogenic than the whole immunoglobulin. Further, bi-specific antibodies specific for 
two or more epitopes may be generated using methods generally known in the art. Further, 
15 antibody effector functions may be modified so as to enhance the therapeutic effect of the' 
antibodies of the invention. For example, cysteine residues may be engineered into the Fc 
region, permitting the formation of interchain disulfide bonds and the generation of 
homodimers which may have enhanced capacities for internalization, ADCC and/or 
complement-mediated cell killing (Caron et al., 1992 J. Exp. Med. 176: 1191-1195; 
Shopes, 1992 J. Immunol. 148: 2918-2922). Homodimeric antibodies may also be' 
generated by cross-linking techniques known in the art (Wolff et al., Cancer Res. 53: 2560- 
2565). The invention also provides pharmaceutical compositions having the monoclonal 
antibodies or anti-idiotypic monoclonal antibodies of the invention. 



25 



The antibodies or fragments may also be produced, using current technology, by 
recombinant means. Regions that bind specifically to the desired regions of the CEG 
protein can also be produced in the context of chimeric or CDR grafted antibodies of 
multiple species origin. The invention includes an antibody, e.g., a monoclonal antibody 
which competitively inhibits the immunospecific binding of any of the monoclonal 
30 antibodies of the invention to a CEG protein. 
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Alternatively, methods for producing fully human monoclonal antibodies, include phage 
display and transgenic methods, are known andmay be used for the generation of human 
monoclonal antibodies (reviewed in: Vaughan et al., 1998 Nature Biotechnology 16: 535- 
539). For example, fully human monoclonal antibodies may be generated using cloning 
technologies employing large human Ig gene combinatorial libraries (i.e., phage display) 
(Griffiths and Hoogenboom, "Building an in vitro immune system: human antibodies from 
phage display libraries" in: Protein Engineering of Antibody Molecules for Prophylactic 
and Therapeutic Applications in Man, Clark, M. (Ed.), Nottingham Academic, pp 45-64 
(1993); Burton and Barbas, "Human Antibodies from combinatorial libraries" Id., pp 65- 
82). Fully human monoclonal antibodies may also be produced using transgenic mice 
engineered to contain human immunoglobulin gene loci as described in PCT Patent 
Application W098/24893, Jakobovits et al., published December 3, 1997 (see also 
Jakobovits, 1998 Exp. Opin Invest. Drugs 7: 607-614). ITus method avoids the in vitrl 
manipulation required with phage display technology and efficiently produces high affinity, 
1 5 authentic human antibodies. 

The antibody or fragment thereof of the invention may be labeled with a detectable 
marker or conjugated to a second molecule, such as a therapeutic agent (e.g., a cytotoxic 
agent) thereby resulting in an immunoconjugate. For example, the therapeutic agent 
includes, but is not limited to, an anti-tumor drug, a toxin, a radioactive agent, a cytokine 
a second antibody or an enzyme. Further, the invention provides an embodiment wherein 
the antibody of the invention is linked to an enzyme that converts a prodrug into a 
cytotoxic drug. 

25 Examples of cytotoxic agents include, but are not limited to ricin, ricin A-chain 
. doxorubicin, daunorubicin, taxol, ethiduim bromide, mitomycin, etoposide, tenoposide' 
vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, diphteria 
toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, arbrin A chain, modeccin A chain, 
alpha-sarcin, gelonin, mitogellin, retstrictocin, phenomycin, enomycin, curicin crotin 

30 cahcheamicin, sapaonaria officinalis inhibitor, and glucocorticoid and other 
chemotherapeutic agents, as well as radioisotopes such as 212 Bi, I31 1, 13, In, 90 Y, and i8«R e 
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Suitable detectable markers for diagnostic used include, but are not limited to, a 
radioisotope, a fluorescent compound, a bioluminescent compound, chemiluminescent 
compound, a metal chelator or an enzyme. Antibodies may also be conjugated to an anti- 
cancer pro-drug activating enzyme capable of converting the pro-drug to its active form. 
See, for example, U.S. Patent Nos. 4,952,394 and 5,716,990. 



Additionally, a recombinant protein of the invention comprising the antigen-binding 
region of any of the monoclonal antibodies of the invention can be made. In such a 
10 situation, the antigen-binding region of the recombinant protein is joined to at least a 
functionally active portion of a second protein having therapeutic activity. The second 
protein can include, but is not limited to, an enzyme, lymphokine, oncostatin or toxin! 
Suitable toxins include those described above. 



Techniques for conjugating or joining therapeutic agents to antibodies are well known 
(Arnon et al., "Monoclonal Antibodies For hnmunotargeting Of Drugs In Cancer Therapy", 
in: Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 243-56, Alan R 
Liss, Inc. 1985; Hellstrom et al, "Antibodies For Drug Delivery", in: Controlled Drug 
Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53, Marcel Dekker, Inc. 1987; Thorpe, 
"Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review", in: Monoclonal 
Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 475-506 
(1985); and Thorpe et al., "The Preparation And Cytotoxic Properties Of Antibody-Toxin 
Conjugates", in: Immunol. Rev., 62:119-58 (1982)). Techniques for joining detectable 
markers to antibodies are also known. 

D ) PHARMACFJJ TICAL COMPOSITIONS Off THE INVENTION 



The invention includes pharmaceutical compositions for use in the treatment of microbial 
infections comprising a pharmaceutically effective amount of an anti-CEG antibody or a 
30 CEG polypeptide. 
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In one embodiment, the pharmaceutical compositions may comprise a CEG antibody, 
either unmodified, conjugated to a therapeutic agent (e.g., drug, toxin, enzyme or second 
antibody) or in a recombinant form (e.g., chimeric or bispecific). The compositions may 
additionally include other antibodies or conjugates (e.g., an antibody cocktail). 

The pharmaceutical compositions also preferably include suitable carriers and adjuvants 
which include any material which when combined with the molecule of the invention 
(e.g., an anti-CEG antibody or a CEG protein) retains the molecule's activity and is non- 
reactive with the subject's immune systems. Examples of suitable carriers and adjuvants 
10 include, but are not limited to, human serum albumin, ion exchangers, alumina, lecithin, 
buffer substances such as phosphates, glycine, sorbic acid, potassium sorbate, and salts or 
electrolytes such as protamine sulfate. Other examples include any of the standard 
pharmaceutical carriers such as a phosphate buffered saline solution, water, emulsions 
such as oil/water emulsion, and various types of wetting agents. Other carriers may also 
15 include sterile solutions, tablets including coated tablets and.capsules. Typically such 
carriers contain excipients such as starch, milk, sugar, certain types of clay, gelatin, 
stearic acid or salts thereof, magnesium or calcium stearate, talc, vegetable fats or oils, 
gums, glycols, or other known excipients. Such carriers may also include flavor and 
color additives or other ingredients. Compositions comprising such carriers are 
20 formulated by well known conventional methods. Such compositions may also be • 
formulated within various lipid compositions, such as, for example, liposomes as well as 
in various polymeric compositions, such as polymer microspheres. 

The pharmaceutical compositions of the invention can be administered using ' 
25 conventional modes of a(irninistration including, but not limited .to, intravenous, 
intraperitoneal, oral, mtralymphatic or administration directly into the tumor. 
Intravenous administration is preferred. 

The pharmaceutical compositions of the invention may be in a variety of dosage forms 
30 which include, but are not limited to, liquid solutions or suspensions, tablets, pills, 
powders, suppositories, polymeric microcapsules or microvesicles, liposomes, and 
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injectable or infusible solutions. The preferred form depends upon the mode of 
administration and the therapeutic application. 

The CEG polypeptides and proteins of this invention are found in common pathogenic 
5 bacterial species such as Streptococcus pneumoniae. This organism causes upper 
respiratory tract infections. Thus, the peptides and proteins of this invention can be used 
as immunogens-in subunit vaccines for vaccination against a pathogenic bacteria such as 
Streptococcus pneumoniae. Additionally, the ceg sequences of the invention can be used 
as DNA vaccines (U.S. Patent No. 5,736,524 and U.S. Patent No. 5,989,553). 

10 

The polypeptides and proteins of this invention can be formulated . as univalent and 
multivalent vaccines. The protein can be mixed, conjugated or fused with other antigens, 
including B or T cell epitopes of other antigens. 

15 Further, when a haptenic peptide of the proteins of the invention is used, (i.e., a peptide 
which reacts with cognate antibodies, but cannot itself elicit an immune response), it can 
be conjugated to an immunogenic carrier molecule. Conjugation to an immunogenic 
carrier can render the oligopeptide immunogenic. Examples of carrier molecules are 
tetanus toxin or toxoid, diphtheria toxin or toxoid and any mutant forms of these proteins 
20 such as CRM.sub.197. Others include exotoxin A of Pseudomonas, the heat labile toxin 
of E. coli and rotaviral particles (including rotavirus and VP6 particles). Alternatively, a 
. fragment or epitope of the carrier protein or other immunogenic protein can be used. For 
example, the happen can be coupled to a T cell epitope of a bacterial toxin. 

25 In formulating the vaccine compositions with the CEG polypeptides or proteins of the 
invention, alone or in the various combinations described, the immunogen is adjusted to 
an appropriate concentration and formulated with any suitable vaccine adjuvant Suitable 
adjuvants include, but are not limited to: surface active substances, e.g., hexadecylamine, 
octadecylamine, octadecyl amino acid esters, lysolecithin, dimethyl- 

30 dioctadecylammonium bromide), methoxyhexadecylgylcerol, and pluronic polyols; 
polyamines, e.g., pyran, dextransulfate, poly. IC, carbopol; peptides, e.g., muramyl 
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dipeptide, dimethylglycine, tuftsin; oil emulsions; and mineral gels, e.g., aluminum 
hydroxide, aluminum phosphate, etc. and immune stimulating complexes. The 
immunogen may also be incorporated into liposomes, or conjugated to polysaccharides 
and/or other polymers. 

The vaccines can be administered to a human or animal in a variety of ways. These 
include intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, oral and 
intranasal routes of administration. Further, the vaccines can be live or inactivated 
vaccines. 



The most effective mode of administration and dosage regimen for the compositions of 
this invention depends upon the severity and course of the disease, the patient's health 
and response to treatment and the judgment of the treating physician. Accordingly, the 
dosages of the compositions should be titrated to the individual patient. 

E) USES OF THE MOLECULES OF THE INVENTION 

1) MOLECULAR WEIGHT MARKERS 

The nucleic acid molecules of the invention and their encoded proteins may be employed 
as molecular weight markers. For example, the molecular weight of each of the nucleic 
acid molecules having ceg sequences and their predicted polypeptides can be determined 
and can be used to compare against other gene sequences and proteins whose molecular 
weights are unknown. 

2) DIAGNOSTICS 

The nucleic acid molecules of the invention may be employed in diagnostic 
embodiments. For example, the presence of nucleotide sequences which are identical or 
similar to the ceg sequences of the invention may be detected within a biological sample. 
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The biological sample may include blood, serum or a swab from nose, ear or throat, may 
be determined by means of a nucleic acid detection assay. 

Nucleic acid probes or primers having sequences complementary to ceg sequences may 
be used in a hybridization assay to detect the presence of the sequences which are 
identical or similar to the ceg sequences of the invention in the biological samples. 
Typically, nucleic acids molecules obtained from a suitable biological sample are 
hybridized with labeled probes or primers. The resulting hybridized molecules are 
detected and resolved by methods well known in the art , such as Northern or Southern 
blotting, micro-array technology, or amplifying with PCR technology. Other 
hybridization techniques and systems are known that can be used in connection with the 
detection aspects of the invention, including diagnostic assays such as those described in 
Falkow et al., U.S. Pat. No. 4,358,535. 

15 Examples of the PCR technology are disclosed in U.S. Patent Nos. 4,683,202 and 
4,965,188 (incorporated herein by reference). Generally, nucleic acid molecules are 
obtained from a suitable biological source and contacted with two primers corresponding 
to the ceg sequences disclosed herein, under conditions which allow for hybridization and 
polymerization to occur. A pair of probes, one corresponding to the 5' flanking region 
20 and the other corresponding to the 3' flanking region, would be sufficient to detect the 
nucleic acid molecules of the invention in a biological sample and may be used to 
indicate the amount of bacteria present. 

Alternative methods of detecting nucleic acid molecules include, for example, in situ 
25 hybridization techniques, where a ceg probe is used to detect homologous sequences 
within one or more cells, such as cells within a clinical sample or even cells grown in 
tissue culture. As is well known in the art, the cells are prepared for hybridization by 
fixation, e.g. chemical fixation, and placed in conditions that allow for the hybridization 
of a detectable probe with nucleic acids located within the fixed cell. 

30 
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The amount of ceg sequences present in a biological sample can be quantified and 
compared to the levels in a normal or "healthy" sample. For example, ceg sequences 
present in either increased or decreased levels, compared to the levels found in the 
control sample may indicate the presence of bacteria. This information is useful for 
diagnosis of a bacterial infection that requires treatment with an antibacterial agent. 

Alternatively, the amount of CEG polypeptides present in a biological sample may be 
determined by means of an immunoassay. For example, labeled antibodies reactive 
against CEG polypeptides may be used in an immuno-reactive assay to detect the 
1 0 presence of CEG polypeptides in the biological samples. 

3 > SCREENING CANDIDATE CEG SEQUENCE'S 

a) Gene Disruption Assay 



15 



The ceg nucleotide sequences of the invention can be used to identify nucleotide 
sequences which are identical or similar to the ceg sequences that are required for 
bacterial cell viability. For example, the ceg sequences can be used in a bacterial gene 
disruption assay to screen candidate nucleotide sequences to identify sequences required 
20 for bacterial cell viability. 

The disruption assay can involve: introducing into a host cell a recombinant vector that is 
capable of integration into the host genome, where the recombinant vector, includes a 
candidate sequence that putatively encodes a cell-viability gene product (e.g., the 
25 exogenous ceg sequence); the vector integrates the candidate sequence into a target 
sequence within the host's genome (e.g., the endogenous ceg sequence); and the host cell, 
so introduced, is screened for viability. The recombinant vector preferably includes a 
selectable marker so that the introduced host cell can be screened for viability in the 
presence of a selectable agent. 

30 
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For example, Figure 1 shows a schematic representation of a gene disruption assay 
vathm a bacterial host cell. In Figure 1A, the recombinant vector, pEVP3, includes the 
CAT gene (e.g., the selectable marker chloramphenicol acetyl transferase) and an internal 
region of the ceg disrupting sequence; the internal region excludes the 5' and 3' ends of 
5 the ceg sequence. The "X" in Figure 1 indicates the recombinant P EVP3 vector undergoing 
homologous recombination with the target sequence (e.g., within the host genome). In 
Figure IB, the resolved P EVP3 vector that is integrated into the host genome, is shown. ' 
Left to right are the following elements: the native promoter of the target gene; a 5' partial 
copy of the target gene; the body of the integrated pEVP3 vector including the disrupting 
10 gene and CAT; and, a 3' partial copy of the target gene. Thus/integration of the pEVP3 
vector via homologous recombination results in two partial gene duplications flanking the 
integrated vector. If the target gene is not essential for survival, it is possible to recover 
chloramphenicol-resistant colonies of £ pneumoniae. Failure to recover chloramphenicol 
resistant colonies, in the presence of the proper controls as described below, indicates that 
15 the target gene may be essential for cell viability. 

More particularly, the gene disruption assay for screening candidate ceg sequences can 
involve the following steps. The recombinant pEVP-3.vector encoding CAT resistance 
and having a fragment of a candidate ceg sequence, can be introduced into 
20 transformation-competent S. pneumoniae cells by methods that are well-known in the art 
(Lee, M.S., et al., 1998 Appl. Environ. Microbiol. 64:4796-4802). The preferred size of 
the ceg fragment can be between about 200 to about 500 bp in length. It is advantageous 
that the candidate ceg sequence does not include the 5' and 3' ends that encode the N- 
and C-teirninal ends of the CEG polypeptide. This insures that the inserted ceg fragment 
25 and the disrupted endogenous ceg gene sequence are not capable of expression of a full- 
length, functional ceg gene product. The transformation-competent cells can be obtained 
by performing the transformation step in the presence of a heptadecapeptide that induces 
competence for transformation of £ pneumoniae (Havarstein, L. S., et al., 1995 P roc . 
Nafl Acad. Sci. 92:11140-11144), such as the CSP-1 peptide. The CSP-1 can be 
30 naturally-derived or synthetic. Additionally, the transformation step can be optimized by 
performing the transformation when the cells have reached a density which is optimal for 
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transformation (e.g., 3 X 10 7 cells per ml.) (Havarstein, L. S. et al. supra). The 
. recombinant vector can be introduced into the competent pneumococci and may undergo 
homologous recombination, whereby the candidate ceg fragment recombines with the 
corresponding endogenous ceg sequence, resulting in targeted integration of the vector 
5 into the pneumococcal genome and disruption of the endogenous ceg. 

The transformed cells can be plated on or cultured in cUoramphenicol-containing growth 
medium. The cells can be cultured under standard conditions, such as 37° C in 5% C0 2 
for approximately 40 to 48 hours, for the purpose of selecting cells that carry the 
1 0 integrated vector. 

Additionally, control samples can be run in parallel with the gene disruption assay, in 
order to determine whether the gene disruption procedure is working properly. For 
example, the control samples can be used to calibrate the gene disruption experiment so 
15 that disruption of a known non-essential bacterial gene results in an approximate number 
of colonies per plate. Similarly, the disruption of a known essential gene can be 
calibrated to yield only zero or one colony per plate. The appearance of one colony is 
due to the rare illegitimate recombination into a non-homologous sequence. In particular, 
a known non-essential gene such as the lytA gene (Tomasz, A., et al., 1988 J. Bacterid. 
20 170:5931-5934) can be used so that between about 70 to 100 chloramphenicol-resistant 
colonies will grow per plate. Similarly, ±eftsZ gene (Lutkenhaus, J. F., et al., 1980 J. 
Bacterial. 143:1281-1288), a known essential gene, can be used to yield zero or, rarely, 
one colony per plate. As is well known in the art, specific parameters that are involved in 
any given gene disruption assay can be adjusted to calibrate the desired number of plated 
25 cells in the control samples. Experimental parameters that can be adjusted include, but 
are not limited to, the R coli strain used to propagate the vector/insert, the fragment 
length of the sequence to be integrated, the amount of recombinant integration vector 
used to transform the cells, use of transformation-competent cells, and plating density of 
the transformed cells. 

30 
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The transformed cells carrying the recombinant integration vector that disrupts 
expression of an endogenous essential gene (e.g., the target ceg gene) can be identified, 
based on a selectable phenotype such as non-viability. For example, the cells that carry a 
disrupted non-essential gene will be viable and, due to the integration of P EVP3, will 
grow on crJoramphemwl-containing medium. In contrast, cells that carry a disrupted 
essential gene will not grow (e.g., non-viable) on the chloramphenicol-containing 
medium. Thus, the transformed cells that do not grow under these selective conditions 
carry an endogenous gene sequence that is essential for cell viability which has been 
disrupted by an exogenous candidate fragment, thereby identifying a ceg sequence. Steps 
one through three may be repeated in order to confirm that the ceg sequences, so 
identified, are essential for cell viability. 

b) Autolysin Assay 

15 It is advantageous to perform additional steps to determine whether the homologous 
recombination events result in disruption of the intended target gene sequence. The lytA 
transformation control can be used to confirm that the transformation system is 
functioning properly. For example, a phenotypic test for autolysin activity (lytA gene 
product) can be performed to determine that the exogenous lytA fragment is correctly 
integrated into the lytA site within the host genome. This typically involves flooding the 
culture plates containing transformants carrying the integrated lytA control vector with a 
solution of detergent, such as 0.1% deoxycholate, which triggers cell lysis in /^-intact 
cells (e.g., the cells that have not undergone homologous recombination). After about 5- 
10 minutes the colonies with intact lytA will appear ghost-like due to cell lysis, and the 
25 colonies with a disrupted lytA gene will appear intact. 

c) Polarity Analysis 

The ceg sequences that are confirmed to be essential for cell viability can be examined 
30 further by performing a polarity analysis to determine if the corresponding endogenous 
ceg sequence is organized in an operon. Polarity is an effect unique to prokaryotes and is 
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the result of the operon organization of bacterial genomes. Many bacterial genes are 
arranged in operons in which multiple genes are under the control of a single regulatory 
sequence (e.g., a promoter) and are transcribed into a single mRNA transcript. With 
respect to the orientation of multiple genes within an operon, the genes that are proximal 
to the regulatory sequence are said to be "upstream" genes and the genes that are distal 
are said to be "downstream" genes. For example, many operons contain genes encoding 
different proteins that catalyze discrete steps of a common biochemical pathway. Thus, 
any of the proteins that catalyze the steps of the pathway may be essential for cell 
viability. 



The presence of operons in a bacterial host genome may influence the interpretations of 
the gene disruption results. For example, disruption of an upstream gene may be 
erroneously interpreted as affecting the expression of the disrupted gene but may, in fact, 
have expression affects on the intact downstream genes. Therefore, it is advantageous to 
1 5 perform a polarity analysis to determine if a ceg sequence is part of an operon. 

A polarity analysis can involve performing an in vivo gene disruption procedure using, as 
the disrupting sequence, a ceg sequence that includes the entire ceg coding sequence 
region but lacking expression regulatory sequences. This differs from the gene disruption 
assay, which involves the central region of the ceg sequence. The polarity analysis 
involves gene duplication via homologous recombination. For example, the pEVP-3 
vector having the entire coding region of a ceg sequence can be used for the polarity 
analysis (Figure 2 A). The polarity analysis will yield different results depending on the 
organization of the endogenous target sequence within the host genome. 



30 



For example, Figure 2 shows a schematic representation of the polarity test for operons, 
within a bacterial host cell. In Figure 2A, the recombinant vector, pEVP3, includes the 
CAT gene and the entire coding region of the ceg disrupting sequence. The "X" in Figure 
2 indicates the recombinant pEW3 vector undergoing homologous recombination with the 
target sequence. Two of the possible results of homologous recombination are shown in 
Figures 2 B and C. 
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In Figure 2 B, case 1, if the endogenous target sequence is not organized in an operon, the 
integration event may yield: a functional target sequence (e.g., it is capable of 
expression); a duplicate non-functional target sequence that lacks a promoter; and a 

5 functional downstream gene (e.g., Gene B) that is controlled by its own promoter. The 
cells carrying this type of integrated target sequence can be recovered as viable cells that 

* grow in the presence of chloramphenicol; this condition is termed "polarity negative". 

In Figure 2 C, case 2, if the target sequence is organized in an operon, then the integration 
10 event may yield an integration site that is similar to that described for case 1 , including: a 
functional target sequence; and a duplicate non-functional target sequence which is not 
functional. However, this integration event may also yield a non-functional downstream 
gene (e.g., Gene B) because expression of this downstream gene is controlled by a 
promoter located upstream of the insertion site. The cells that carry this type of 
15 integrated target sequence will be non-viable; this condition is termed "polarity positive". 
Thus, the polarity analysis provides a method to determine whether integration of a 
recombinant vector into a target ceg sequence effects expression of downstream genes. 

The ceg sequences disclosed herein (SEQ ID NOs.: 1-113, 227-331) encode gene 
20 products that are essential for viability in & pneumoniae. Furthermore, many of these 
ceg sequences have been analyzed for the polarity effect and the results are presented in ' 
Table I. One subset of ceg sequences is classified as polarity negative (-), since the 
homologous recombination event did not effect the expression of downstream genes. 
Another subset of ceg sequences is classified as polarity positive (+), since the 
5 homologous recombination event did affect the expression of downstream genes. The 
ceg sequences that have not yet been classified as polarity positive or negative are 
indicated in Table I as a blank. For the ceg sequences that are classified as polarity 
positive, the genes downstream of the disrupted endogenous ceg sequences may or may 
riot also be essential. 
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4 > ASSAYS FOR IDENTIFYING CEG LIGANDS AND OTHV.R 
BINDING AGENTS ~ ass 

The present invention provides screening methods for identifying agents that interact 
5 and/or bind to the CEG proteins of the invention, such as a ligand. An agent can be, for 
example, a natural product, a derived or synthetic chemical molecule, a polypeptide, a 
nucleic acid molecule, or a metal. The agents that interact with CEG proteins may cause 
bacterial cell death by disrupting the functions of CEG proteins, including, but not 
limited to, nucleotide biosynthesis, DNA replication, RNA transcription, protein 
10 translation, and/or cell wall biosynthesis. Accordingly, the present invention provides 
screening methods for identifying agents having antibacterial activity, such as agents that 
cause bacterial cell death by interacting with the CEG proteins. These antibacterial 
agents are useful for treating diseases and afflictions associated with bacterial infections. 

15 Various methods can be used to discover agents having antibacterial activity, as 
determined by the ability of the binding agent to bind to a CEG protein and disrupt the 
function of the CEG protein. These screening methods include whole cell in vivo assays 
as well as in vitro assays with cellular components. 

20 An in vivo screening method for identifying ligands that bind CEG polypeptides can be 
performed in a whole cell assay. A typical method may be the use of whole bacterial 
cells to assess the antibacterial properties based on cell growth or viability. These 
methods can include methods for measuring cell growth and/or viability, for example, by 
optical density or zones of growth (Koch, A. L. et al., 1970 Anal. Biochem. 38:252-259; 
25 Biemer, J. J. et al., 1973 Ann. Clin Lab. Sci. 2:135-140; Manual of Clinical 
Microbiology; 7 th edition, Murray, P. R. (ed), ASM Press), by growth inhibition in an 
agar assay (Murray, P. R., supra), or other means of detecting cell metabolism 
(Mychajlonka, M. et al., 1980 Antimicrob. Agents Chemother. 17:572-582), and are well 
known to those skilled in the art. In addition, there are molecular biology-based detection 
30 methods for use with whole bacterial cells, such as gene reporter assays, to monitor the 
effect of the ligand on specific targets (Slauch, J. M., et al., 1991 Methods Enzymol 
204:213-248). Examples of the reporter genes include, but are riot limited to, beta- 
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galactosidase, alkaline phosphatase, luciferase, and green fluorescent protein. For 
example, one embodiment provides a reporter system that monitors inhibition of DNA 
synthesis by fusing a reporter such as beta-galactosidase (lacZ) to genes known to be 
upregulated by the cessation of DNA synthesis as a result of the binding of ligands to the 
5 DNA synthetic apparatus. (Shurvinton, C. E., et al., 1982 Mol. Gen. Genetics 185:352- 
355; Rosato, A., et al., 1998 Antimicrob. Agents Chemother. 42:1392-1396). 

Alternatively, the yeast two-hybrid system (Fields, S. and Song, O. 1989, Nature 
340:245-246) may be adapted to screen for ligands that bind CEG polypeptides. Generally, 
1 0 the yeast two-hybrid system is performed in a yeast host cell carrying a reporter gene, and 
is based on the modular nature of the GAL transcription factor which has a DNA binding 
domain and a transcriptional activation domain. The yeast two-hybrid system relies on 
the physical interaction between a recombinant polypeptide that comprises the GAL 
DNA binding domain and another recombinant polypeptide that comprises the GAL 
1 5 transcriptional activation domain. The physical interaction between the two recombinant 
polypeptides reconstitutes the transcriptional activity of the transcription factor, thereby 
causing expression of the reporter gene. Either of the recombinant polypeptides used in 
the two-hybrid system can be generated to include a CEG polypeptide sequence to screen 
for binding partners of CEG. 

20 

Another method uses the bacterial CEG proteins as the basis for in vitro assay systems to 
detect binding agents. Typically, the in vitro screening method comprises: a) generating 
the CEG protein of the invention, or membranes enriched in the CEG protein; b) 
exposing the CEG protein or membranes to a candidate agent; and c) detecting the 
25 interaction of the CEG protein with the agent by any suitable means. Additionally, the 
screening methods may be adapted to automated high-throughput procedures, such as 
PANDEX.RTM Baxter-Dade Diagnostics, allowing for efficient high-volume screening 
of candidate agents. 

30 An alternative method for screening potential ligands involves an in vitro binding 
procedure. Typically, the CEG proteins of the invention can be produced using 
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recombinant DNA technology and host-vector systems as described herein. A candidate 
agent is introduced into a reaction vessel containing the CEG protein, of fragment 
thereof; the candidate agents may be detectable by methods such as, but not limited to, 
radioisotope or chemical labeling. Binding of the CEG protein by a candidate agent can' 
be determined by any suitable means, including, for example, quantifying bound label 
versus unbound label using any suitable method. Binding of a candidate agent may also 
be detected by methods similar to an alternative physical method disclosed in U.S. Patent 
No. 5,585,277. In this method, binding of a candidate agent to a protein is assessed by 
monitoring the ratio of folded protein to unfolded protein, for example by monitoring 
sensitivity of the protein to a protease, or amenability to binding of the protein by a 
specific antibody against the folded state of the protein, or binding to chaperone protein, 
or by binding to any suitable surface. 

The invention provides methods of identifying compounds that modulate (e.g., activate or 
15 inhibit) the function of a CEG polypeptide. Essentially any compound can be used in the 
assays of the invention. The preferred compounds are those that are soluble in aqueous 
or organic solutions. It will be appreciated by those of skill in the art that there are many 
commercial suppliers of chemical compounds that can be used in the methods of the 
invention, including Sigma Chemical Co. (St. Louis, Mo.), Aldrich Chemical Co. (St. 
20 Louis, Mo.), Sigma-Aldrich (St Louis, Mo.), Fluka Chemika-Biochemica Analytika 
(Buchs, Switzerland), and the like. 

The present invention provides methods for detecting compounds which are identified as 
modulators of CEG function. The methods of the invention can be performed using 

25 isolated CEG polypeptides, or use whole cells expressing the CEG polypeptide. The 
steps, of the method using isolated CEG polypeptides include: contacting the isolated 
CEG polypeptide with a candidate compound; and deterniining whether the function of 
the CEG polypeptide is altered. The steps of the method using whole cells include: 
contacting the whole cells with a candidate compound; and detennining whether the cell 

30 dies, indicating the compound inhibited the function of a CEG polypeptide. 
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The preferred methods of the invention provide high-throughput screening assays for 
identifying compounds which modulate the function of a CEG polypeptide. The high 
throughput methods permit screening of large libraries of compounds. For example the 
high throughput methods can use automated assay steps. The assays can be performed 
5 in parallel on a solid support, as microtiter formats on microliter plates in robotic assays 
are well known. A preferred embodiment of the methods includes adapting the methods 
to use microtiter plates or pico- nano- or micro-liter arrays. In high throughput assays it 
is desirable to run positive controls to ensure that the components of the assays are 
working properly. 

10 

The high throughput screening methods of the invention include providing a 
combinatorial library containing a large number of compounds (candidate modulator 
compounds) (Borman, S, C & E. News, 1999, 70(10), 33-48). Such combinatorial 
chemical libraries can be screened in one or more assays to identify library members 
15 (particular chemical species or subclasses) that exhibit the ability to modulate the 
function of the CEG polypeptide (Borman, S., supra; Dagani, R. C. & E. News, 1999, 
70(10), 51-60). The compounds, so identified, can serve as lead-compounds or can 
themselves be used as potential or actual therapeutics. 

20 A combinatorial chemical library is a collection of diverse chemical compounds 
generated by using either chemical synthesis or biological synthesis, to combine a 
number of chemical building blocks, such as reagents. For example, a linear 
combinatorial chemical library, such as a polypeptide library, is formed by combining a 
set of chemical building blocks (amino acids) in every possible way for a given 

25 compound length (i.e., the number of amino acids in a polypeptide compound). Millions 
of chemical compounds can be synthesized through such combinatorial mixing of 
chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to those of skill in 
30 the art. Such combinatorial chemical libraries include, but are not limited to, peptide libraries 
(see, e.g., U.S. Pat No. 5,010,175, Furka, Int. J. Pept Prot Res., 1991, 37:487-493 and 



67 



WO 01/49721 



PCT/USOO/35604 



Houghton, et al., Nature, 1991, 354, 84-88). Other chemistries for generating chemical 
diversity libraries can also be used. Such chemistries include, but are not limited to, peptoids 
(PCT Publication Np. WO 91/19735); encoded peptides (PCT Publication WO 93/20242); 
random bio-oligomers (PCT Publication No. WO 92/00091); benzodiazepines (U.S. Pat. No. 
5 5,288,514); diversomers, such as hydantoins, benzodiazepines and dipeptides (Hobbs, et al., 
Proc. Nat. Acad Sci. USA, 1993, 90, 6909-6913); vinylogous polypeptides (Hagihara, et al., J. 
Amer. Chem. Soc. 1992, 114, 6568); nonpeptidal peptidomimetics with fo>ta-D-glucose 
scaffolding (Hirschmann, et al., J. Amer. Chem. Soc, 1992, 114, 9217-9218); analogous 
organic syntheses of small compound libraries (Chen, et al., J. Amer. Chem. Soc, 1994, 116, 
10 2661; Armstrong, et al. Acc. Chem. Res., 1996, 29, 123-131); or small organic molecule 
libraries (see, e.g., benzodiazepines, Baum C&E News, 1993, Jan. 18, page 33,); 
oligocarbamates (Cho, et al., Science, 1993, 261, 1303); and/or peptidyl phosphonates 
(Campbell, et al., J. Org. Chem. 1994, 59, 658); nucleic acid libraries (see, Seliger, H et al., 
Nucleosides & Nucleotides, 1997, 16, 703-710); peptide nucleic acid libraries (see, e.g., U.S. 
15 Pat. No. 5,539,083); antibody libraries (see, e.g., Vaughn, et al., Nature Biotechnology, 1996, 
14(3), 309-314 and PCT/US96/10287); carbohydrate libraries (see, e.g., Liang, et al., Science, 
1996, 274, 1520-1522 and U.S. Pat. No. 5,593,853, Nilsson, UJ, et al., Combinatorial 
Chemistry & High Throughput Screening, 1999 2, 335-352; Schweizer, F; Hindsgaul, O. 
Current Opinion In Chemical Biology, 1999 3, 291-298); isoprenoids (U.S. Pat. No. 
20 5,569,588); thiazolidinones and metathiazanones (U.S. Pat. No. 5,549,974); pyrrolidines (U.S. 
Pat. Nos. 5,525,735 and 5,519,134); morpholino compounds (U.S. Pat. No. 5,506,337); 
benzodiazepines (U.S. Pat. No. 5,288,5 14); and other similar art. 

Devices for the preparation of combinatorial libraries are commercially available (see, 
25 e.g., 357 MPS, 390 MPS, Advanced Chem. Tech, Louisville Ky., Symphony, Rainin, 
Woburn, Mass., 433A Applied Biosystems, Foster City, Calif., 9050 Plus, Millipore, 
Bedford, Mass.). In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, 
Tripos, Inc., St. Louis, Mo., ChemStar, Ltd., Moscow, RU, 3D Pharmaceuticals, Exton, 
30 Pa., Martek Bio sciences, Columbia, Md., etc.). 
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In the high throughput methods of the invention, several thousand different candidate 
compounds can be screened in a relatively short period of time. For example, each well 
of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or if concentration or incubation time effects are to be observed, every 5-10 
5 wells can test a single modulator. Thus, a single standard microtiter plate can assay about 
100 (96) modulators. If 1536 well plates are used, then a single plate can easily assay 
from about 100 to about 1500 different compounds. It is possible to assay many different 
plates per day; assay screens for up to about 6,000-20,000, and even up to about 1 00,000- 
1,000,000 different candidate modulator compounds are possible using the methods of 
10 the invention. * 

The following examples are presented to illustrate the present invention and to assist one of 
ordinary skill in making and using the same. The examples are not intended in any way to 
otherwise limit the scope of the invention. 

15 

EXAMPLE 1 

The following provides a general description of how a list of candidate ceg sequences 
was generated. The list was generated by selecting candidate ceg gene sequences from a 
20 Concordance web engine using the method described in: Bruccoleri, R.E., Dougherty, 
T.J., Davison, D.B. (1998) "Concordance analysis of microbial genomes" in: Nucleic 
Acids Res 26:4482-4486. 

Microbial Genomics CEG Discovery Process Summary. 

25 

Microbial Concordance Analysis 

The entire genomic sequence data of various bacteria was acquired from several public 
and proprietary sequence database sources, including GTC (Genome Therapeutics 
30 Corporation), and TIGR (The Institute for Genomic Research). 
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Predicted ORFs from the genomic data were identified, translated, and stored. The 
desirable ORFs were at least 90 amino acid residues in length. Concordance analysis 
was performed among bacteria and various parameters were used to filter out genes with 
high similarity to eukaryotes. 

5 

Concordance Analysis 

The entire genomic sequence of various Eubacteria was acquired from several public and 
private sources. The proprietary PathoGenome System from Genome Therapeutics 

10 Corporation, Waltham, MA, USA contributed data. Public data was obtained from 
GenBank (http://ncbi.nlm.nih.govl The Institute for Genomic Research (TIGR), the 
Yeast Proteome Database, from Proteome, Inc. of Beverly, MA, and the Sanger Center of 
the Medical Research Council of the United Kingdom (http://www.sanger.ac.uk). 
Additionally, the non-microbial sequence data used as a basis for comparison and data 

15 subtraction was obtained from a proprietary database, including the LifeSeq Database 
from Incyte Pharmaceuticals, Palo Alto, CA. 

Where required, Incyte nucleotide sequences were translated into protein sequences in all 
six possible reading frames. GTC supplied predicted protein sequences with their data. In 
20 the case of other eubacterial nucleotide sequences, the program CRITIC A (Badger, J. and 
Olsen, G., 1999 "CRIT1CA: coding region identification tool invoking comparative 
analysis" in: Molecular Biology and Evolution 16:512-524). The sequences were stored 
in flat files on a Unix computer system. Each predicted amino acid sequence had to be 
greater than 90 amino acids. 

25 

Each predicted protein sequence was compared to every other sequence (an "all-against- 
all" comparison). The program used was FASTA (Pearson, W.R., "Flexible sequence 
similarity searching with the FASTA3 program package." Methods in Molecular Biology 
2000 132:185-219.) The parameters used were ktup=2, and all scores above the default 
30 cutoff were kept. The output was processed and stored in a PostGres 95 database 
(http://www.postgresql.orgl Graphical user interfaces, using web browser technology, 
were constructed to query the database. 
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A Concordance Analysis was performed on the data. The question used to generate the 
dataset was show all Streptococcus pneumoniae open reading frames with a similarity 
5 greater than or equal to 30% overall protein sequence identity to both selected gram- 
positive and/or gram-negative bacteria in the database. The data was further required not 
to match yeast or human sequences at greater than 30% overall protein sequence 
similarity. The resulting dataset included a list of more than 400 conserved amino acid 
sequences having known or unknown function. The amino acid sequences having 
10 unknown functions formed the basis of a list designated Conserved Unknown Reading 
Frames, or CURFs which is a subset of the total list of CEGs (e.g., CURFs includes 
known and unknown). 

The resulting list of conserved genes (e.g., more than 400 sequences) was used as a basis 
15 for selecting and screening bacterial gene sequences that are essential for cell viability. 
The Concordance system was designed to permit high-throughput identification of 
conserved gene sequences in the database. (Bruccoleri, R, Dougherty, T, and Davison, D. 
1998 "Concordance analysis of microbial genomes" Nucleic Acids Res. 26:4482-4486.) 

20 Data Curation And Analysis 

Exact N-terminal and C-terminal translational start sites of genes were identified by 
pairwise similarity searches, multiple sequence alignments. Ribosome binding sites, 
terminators, nearby genes, operons were identified. 

25 

The resulting list of conserved genes was used as a basis for selecting and screening 
bacterial gene sequences that are essential for cell viability. This Concordance system 
was designed to permit high throughput use of the conserved gene sequences contained . 
on the list. A set of Knockout PCR primers were generated, based on the list of 
30 conserved genes, for the purpose of use in the gene disruption procedure described 
below. The PCR primers were designed to amplify a central 300-500 bp region of the 
ceg (to prevent generation of a functional copy of the ceg gene following integration), 
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ordered electronically, the primers were placed in a 96-well format, and used in the gene 
disruption procedure as described below. 

EXAMPLE 2 

The following provides a description of the procedure to generate recombinant vectors of 
pEVP-3 having inserts of candidate ceg nucleotide sequences. The Knockout primers 
generated by the method described in Example 1 above were used to generate DNA 
fragments comprising candidate ceg sequences. 

Genomic PCR Knockout Target Fragment Generation 

96-well plate format were set up (36 \il H 2 0 , 5 nl 10* Vent™ buffer, 1 jil gene specific, 
knockout forward primer (0.5 \igf\i\), 1 yl gene specific knockout reverse primer (0.5 
fig/jil), 0.5 jil Vent™ DNA polymerase (2000 U/ml New England Biolabs, Beverly, 
MA), 1.5 \xl each dNTPs (lOmM; 6.0 jxl total), 0.5 \xl S. pneumoniae chromosomal DNA 
(0.5 jag/jal), 50 ^1 total volume/reaction); 

The nucleotide sequences of the forward and reverse knockout primer pairs were 
generated from the nucleotide sequence information obtained from the Genomic 
Therapeutics Corporation database for Streptococcus pneumoniae. The primer pairs were 
each used in a PCR reaction to generate a unique internal (e.g., central region) fragment 
of the candidate gene targeted for knockout. 

The PCR program was set in the PCR machine (Initial 95 °C - 5 minutes: 30 Cycles of: 
95 °C - 1 minute, 58 °C - 1 minute, 72 °C - 30 seconds; Final, 72 °C - 10 minutes, 4 °C - 
hold indefinitely). 5 \il of each reaction was run on an 0.8% agarose gel after purifying 
fragment over PCR purification kit (Qiagen) to visualize the fragments then ligation 
reactions were performed. 
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Ligation Reactions proceeded (set up in. 96-well plate format (10.0 |il genomic PGR 
fragment (generated from step 2 above), 1.0 jil pEPV-3 Smal-cut vector (1:10 dilution of 
vector DNA at 50-100 ng/|al), 1.5 ^1 10x ligation buffer (New England Biolabs™), 1.0 ^1 
T4 DNA Ligase (New England Biolabs™ 400,000 U/ml), 1.5 \\l ddH 2 0, 15.0 jal total 
reaction volume). 

Reactions were allowed to incubate in 96-well plate at 14 °C overnight in the PCR 
machine. Transformations into E. coli for in vivo amplification were proceeded the 
following day. 

The nucleotide sequences of the forward and reverse primer pairs used for the polarity 
test were generated in a similar manner, from the nucleotide sequence information 
obtained from the Genomic Therapeutics Corporation database for Streptococcus 
■pneumoniae. The primer pairs were each used in a PCR reaction to generate a unique 
fragment of the candidate gene targeted for the polarity test. The fragment generated for 
the polarity test included the entire ceg coding sequence region but lacking the expression 
regulatory sequences. 

Transformation into E. coli (strain LE392): 

The next day, 3 ^1 of above ligation mix was used per transformation reaction plus 50 (il 
LE392 competent cells. Reactions were set up in 96-well plate format; incubated on ice 
for 30 minutes; heat-shocked at 42° C for 90 seconds; and incubated on ice 2 minutes; 
100 |il SOC media (Gibco BRL) was added; then incubated at 37° C on platform shaker 
for 1 hour; plated on LB/chloramphenicol (13.0 jag/ml) agar plates for constructs over 
night at 37° C with plates inverted and proceeded with colony PCR to confirm constructs. 
The universal primers flanking the insert site in pEVP-3 were used for PCR 
amplification. 

The colony PCR involved the following. 96-well plate format was set up (36.5 \il H2O, 
0.5 yl pEPV3 forward primer (0.25 iigf\xl), 0.5 ^1 pEPV3 reverse primer (0.25 fig/jil), 1.5 
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\il each (6.0 \il total) dNTPs (10 mM), 0.5 ^il Vent™ DNA polymerase, 5 |nl 10* Vent™ 
buffer, 1 fil of a 1:50 cell dilution, 50 |il total volume). 

pEPV3 forward primer: 5' CATCAAGCTTATCGATACCGTCG 3' (SEQ ID NO:437) 
5 p EPV3 reverse primer: 5' CACAGTAGTTCACCACCTTTTCCC 3' (SEQ ID NO:438) 

Colonies of E. coli LE392 .were picked onto a master plate of LB + 13 jig/ml 
chloramphenicol (incubate throughout the day at 37° C) and then into 50 \x\ H 2 0 which 
has been placed into a 96-well plate. 1 \x\ of this dilution was used in above PCR reaction 
1 0 (if the 96-well dilution plate is kept you will not need to prepare a master plate). Cultures 
for minipreps of plasmid candidates may be prepared directly from the cell dilutions. 

The PCR program was run (95 °C - 5 minutes, 30 Cycles of: 95 °C - 1 minute, 58 °C - 1 
minute, 72 °C - 30 seconds, 72 °C - 10 minutes, 4 °C - hold). 

15 

A 10 til/ reaction was run on a 1.0 % TBE gel. A gel designed for 96 well plates and a 
multichannel pipettor were used to ease loading of the sample rows. The gel was run and 
stained with ethidium bromide. The positive clones were identified with appropriate 
molecular size insert(s), amplified by the flanking pEVP-3 primers. 

20 

Minipreps Of Plas mids To Identify Cells Carrying The Pevp-3 Vector With An Insert 

The constructs that carried an insert were identified. The constructs having an insert 
were inoculated into a 5 ml LB/Cm culture, and incubated over night at 37 °C with 
25 aeration. Miniprep plasmid DNA was prepared by a standard procedure. The miniprep 
DNA was digested with appropriate restriction enzymes to confirm the presence of the 
insert (enzymes flank Smal site in pEVP-3) (10 fal miniprep DNA, 2 jil 10 * buffer, 1 jil 
Xbal, 1 jil Xhol, 6 ^1 ddH20; 20 fal total volume for digest). 
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To confirm the presence of an insert, the digest reactions were electrophoresed on an 
agarose gel and the gel was stained with ethidium bromide. The positive clones were 
used for the S. pneumoniae KNOCKOUTs procedure. 

The confirmatory PCR reactions, using knock out-specific primers (quality control step) 
involved 35.5 |il H 2 0, 5 \xl 10 * Vent™ buffer, 1 \xl knockout forward primer (0.5 |ig/fil), 
1 |il knockout reverse primer (0.5 Hg/|il), 0.5 |il Vent™ (6.0 jil total) DNA Polymerase 
(2000 U/ml), 1.5 \il each dNTPs (lOmM, 6.0 |xl total), 1.0 \il miniprep DNA from test 
clone, 50 \x\ total reaction volume. The PCR program was as follows: 95 °C for 5 
minutes, 30 Cycles of: 95 6 C for 1 minute, 60 °C for 1 minute, 72 °C for 30 seconds, 72 
°C for 10 minutes, hold at 4 °C. The presence of the correct-sized insert was confirmed 
by agarose gel electrophoresis and ethidium bromide staining. The confirmed clones 
were used for the S. pneumoniae gene KNOCKOUT procedure. Glycerol stocks were 
made of all positive E. coli LE392 constructs and frozen at - 80 degrees C. 

EXAMPLE 3 

The following provides a description of the high throughput gene disruption procedure 
used in S. pneunomiae strain (e.g., gene knockout procedure). The candidate ceg 
fragments that were generated by the method described in Example 2 were used in the 
gene disruption procedure in order to identify ceg nucleotide sequences that are required 
for cell viability. 

Reactions were set up in a 1.5 ml eppendorf tubes or 96 well plate (1 jig total of miniprep 
pEVP-3 + insert DNA (usually 10 fil of Qiagen miniprep DNA); then 200 [il of S. 
pneumoniae (strain Rx-1) competent cells diluted 1:10 in competence media was added 
(1 ml of competence media = 980 pi Todd Hgqvitt (Difco Laboratories) with 0.5% yeast 
extract, 20 \xl 10% BSA, 1 |il 10 % CaC12, and 0.5 \il (200 |ag/ml) Csp-1 competence 
peptide). 
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Controls were run with each KNOCKOUT experiment and involved 1 jig pEPV3 Lyt A 
construct = positive control (non-essential), or 1 pEPV3 Fts Z construct = negative 
control (essential). Then the 96 well plates and controls were incubated at 37 °C for 2.5 
to 3 hours in 37 °C room without shaking. The 200 of the samples were plated on 
5 Todd Hewitt agar plates with 0.5% yeast extract and 2 \ig/ml chloramphenicol. 

The samples were incubate over night at 37 °C in 5% C0 2 incubator. Control plates were 
checked for presence of colonies (pEVP-3::lytA) and no growth (pEVP-3::ftsZ). Plates 
were examined for growth (ca. 70-150 colonies) designating nonessentials and zero 
1 0 colonies designating essential genes. 

The polarity test was performed in a similar manner, using the polarity fragments, 
described in Example 3. 

15 

EXAMPLE 4 

The following provides a description of the autolysin procedure used to determine that 
the non-essential control samples of S pneumoniae contain a disrupted lyt A gene. 

20 

Phenotvpic Autolysin Test 

The culture plates containing transformants carrying the lytA control vector were flooded 
25 with 0.1% deoxycholate in H 2 0. The plates were observed after 5-10 minutes. Plates 
with "ghosts" indicated intact lytA gene, or plates without "ghosts" indicated a disrupted 
lyt A gene. The "ghost" phenomenon is due to detergent triggered autolysis of the cells, 
causing a gradual fading of the colonies. 

30 The detergent treatment triggers the autolysin in lytA intact cells; it cannot trigger the 
autolysin (lytA gene product) in lytA disrupted cells. Colonies with intact lytA "ghost" in 
5-10 minutes due to massive pneumococcal cell lysis. 
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EXAMPLE 5 

The following provides a description of the procedure used to express the CEG proteins 
5 (e.g., designated CFE proteins) in E. coli cells. 

CEG Protein Production 

Full-length ceg gene were inserted into pET-21 expression vector using the E. coli BL21 
10 XDE3 expression system using the following method: 

For each ceg, custom primers were used to insert N- and C- termini into vectors such that 
the 5' end (N-terminus of the CEG) is positioned properly for expression behind the T7 
promoter and optimally placed with regard to the pET ribosome binding site. The pET 
15 vectors contain an Ndel site which allows positioning of ATG start site in the vector. In 
cases where the ceg sequence contains an internal Ndel site, blunt ligation of the ceg PCR 
fragment into the vector is accomplished via Klenow fill-in of the Ndel site. In many 
cases, primers were also designed such that the ceg 3' (C-terminus of the expressed 
protein) will contain an in-frame extension of 6X-histidine residues, encoded in the 
20 vector sequence of pET-21. The individual cegs were PCR amplified via custom 
designed primers as described above. Both ceg PCR and vector DNA were digested with 
appropriate restriction enzymes. The full-length ceg were ligated into the pET 
expression vector. The ligation mixture was transformed into competant K coli BL21 
ADE3 cells and selected for transformants on LB agar with 50 jig/ml ampicillin. Positive 
25 insert bearing clones were screened via minipreps of the plasmids and size analysis on 
0.8% agarose gels, with detection by ethidium bromide staining, as above. 

Protein Production 

30 The proper reading frame of each ceg inserted into pET-21 is verified by DNA 
sequencing. 
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A small (2-5 ml) test culture of £ coli BL21 ADE3 with the insert-bearing plasmid is 
tested for protein expression by IPTG induction of the expression vector for 1-2 hours. 
The expression is verified by SDS-Polyacrylamide Gel Electrophoresis analysis of a 
5 whole cell extract (SDS extract of 0.5-1 ml of cells treated at 100 °C for 5 minutes) to 
determine whether the protein is over-expressed and migrates at the correct predicted 
molecular weight. 

The protein is overproduced and purified, via the following method. A large scale (500- 
10 1000ml) culture of E. coli is grown to early logarithmic phase in broth (e.g., LB broth) 
and protein expression induced for 2 hours with IPTG (isopropyl-D-thiogalactoside). 
The cells are harvested by centrifugation (8000 X G; 15 minutes) and the cell pellets 
resuspended in 20 ml. of buffer. The cells are lysed by sonication, and the supernatant 
fluid centrifuged at low speed (5000 X G, 15 min.) to remove unbroken cells; The 
15 supernatant fluid, containing the over-expressed protein is subjected to Ni- NTA affinity 
column chromatography (Quiagen, Inc., Chatsworth, CA). The 6X-histidine residues 
linked at the C-terminal end of the CEG proteins permit rapid protein purification via 
selective binding to a Ni-NTA resin column. The protein-bound Ni-NTA resin was to 
remove contaminants, and the bound proteins subsequently eluted with imidazole and 
20 recovered. It is possible to upscale this procedure to larger volumes for higher yields of 
proteins. 

EXAMPLE 6 

25 The following provides a description of the methods used to purify all 2CEG 
polypeptides (e.g., 2CFE polypeptides #19-117; SEQ ID NOS:349-436) having a 
histidine tag at their C-terminal ends. The 2CEG polypeptides having the his-tags were 
produced by the methods described in Example 5, supra As an example, results of 
purification of 2CFE 75 polypeptide are presented. 

30 
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The BL21XDE3 cells harboring recombinant pET-21 vectors carrying a 2CFE nucleotide 
sequence (SEQ ED NOS:244-331) were cultured in LB broth containing ampicillin. 
When the A600 reached approximately 0.6, protein production was induced by adding 1.0 
mM of IPTG, the cells were cultured for an additional 2 hours. The cell pellet was 
collected by centrifugation, and the collected cell pellet was sonicated in Solution A (50 
mM NaP0 4 ; 300 mM NaCl, pH 8.0). The sonicated cells were centrifuged at 10,000 
■ RPM to remove the debris. 

Purification Of The CFE Polypeptide 

The supernatant was diluted with Solution A, loaded onto a Ni-NTA column (Quiagen) 
equilibrated with Solution A; the column bed size was 2.5 x 25 cm, and the flow rate was 
approximately 3.0 ml/minute. The 2CFE protein was eluted using a linear gradient of 
imidazole, using 0-250 mM in 450 ml, flow rate approximately 3.0 ml/minute. The 
eluted samples were collected as 22 ml fractions per tube and the eluted samples were 
monitored using spectrophotometry. The amount of protein in the eluted fractions was 
estimated using the Bradford method (Bradford, M. M., 1976 Anal Biochem. 72:248) and 
the samples were run on an SDS-PAGE gel (Novex EC6008) (Figure 3 A). Fractions 
were selected for pooling based on the results of the SDS-PAGE gel. The pooled 
fractions were concentrated using a 10,000 MW Centricon (Amicon) to approximately 5 
ml. 

The 2CFE 75 polypeptide, a precipitate formed and was redissolved upon increasing the 
sample volume and removing the imidazole by repeated concentration in 50 mM Tris, 
100 mM NaCl, pH 7.5. Varying amounts of the 2CFE 75 polypeptide were diluted in 
either 20 mM Tris, 20 mM KC1, pH 7.5 or 20 mM Tris; 20 mM MgCl 2 , pH 7.5 at 
- concentrations of 12, 24, or 36 ug/ml. The diluted samples were electrophoresed on an 
SDS-PAGE gel under non-reducing conditions (Figure 3 B). The results of Figure 3 B 
suggests that 2CFE 75 forms a multimer. 
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EXAMPLE 7 

the following provides a description of the methods used to purify CEG polypeptides 
5 that lack a histidine tag (e.g., 2CFE polypeptides #1-17; SEQ ID NOS:332-348). As an 
example, the results of purification of CFE 3 polypeptide are presented. 

Purification of the CFE 3 Polypeptide 

10 The 2CFE 3 polypeptide was produced using the large scale IPTG-induced method 
described in Example 5, supra. The 2CFE 3 (SEQ ID NO:334) polypeptide lacks a C- 
terminal histidine tag. The 2CFE 3 polypeptide was purified using a 2-column 
procedure. The 2CFE 3 polypeptide preparation was eluted from a 26/10 Q Sepharose 
column (Pharmacia) using a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the 

15 gradient size was 1 liter. Then the 2 CFE 3 polypeptide was eluted from a hydroxy apatite 
Bio-gel column (Bio-Rad) using a 5-200 mM potassium phosphate (pH 8.0) gradient, the 
flow rate was 0.3 ml/minute, and the gradient size was 300 ml A sample of the 2CFE 3 
preparation was run on a polyacrylamide gel (Figure 4). 

20 EXAMPLE 8 

The following provides a description of the size exclusion chromatography methods used 
, to estimate the molecular weight and determine whether the CEG polypeptides 
oligomerize. The CFE polypeptide may olimerize to form monomers, dimers, tetramers, 
25 hexameric rings, or other oligomeric forms. 

Size exclusion chromatography was performed on all isolated 2CFE polypeptides #s 1- 
1 17 (e.g., SEQ ID NOS:332-436). This method was performed using various types of 
columns, depending on the particular 2CFE polypepeptide tested. 

30 
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The Biosil SEC- 125 HPLC Gel Filtration column (BioRad Laboratories, Inc) was used, 
for example, to characterize CFE 8. The mobile phase was 0.2 M KH 2 P0 4 , 0.9% NaCl 
pH 6.8. 

The Phenomenex 600 x 7.5 mm Biosep SECS 3000 column was used, for example to 
characterize 2CFE 21 and 39. The mobile phase for size exclusion was 50 mM 
Na 2 HP0 4 , pH 7.0 and 150 mM NaCl run at 1 ml/minute in a Gilson HPLC system, with 
protein detection at 280 nm. 

EXAMPLE 9 

The following provides a description of the computer-aided methods used to search for 
similarities between the amino acid sequences of the CEG polypeptides and sequences 
available through public and proprietary databases. In many cases, the function of the 
CEG polypeptides was suggested by the results of the similarity searches. The function 
of some of these CEG polypeptides has been confirmed by performing additional 
analyses. Table V provides a list of the suggested and confirmed functions of CEG 
polypeptides designated CFEs #1-117. 

The- putative function of the CFE polypeptides were determined using computer-aided 
bioinformatic approaches, including distant homologies, motif searching, or predictions 
based on statistical rules. For example, the distant homology approach involved pairwise or 
multiple sequence alignments, employing tools such as FASTA, and Psi-BLAST. The 
motif searching approach involved using sophisticated hidden Markov models. The 
approach based upon predictions of statistical rules involved prediction of transmembrane 
regions, coiled-coil, and other structural motifs. These approaches have been reviewed in 
Computational Methods In Molecular Biology 1998, eds. Salxber, S.L., Searls, D.B. Searls, 
and Kasif, S. , Elsevier, and in Bioinformatics: A Practical Guide To The Analysis Of Genes 
And Proteins 1998 eds Baxevanis, A. D. and Francis Ouellete, B.F. , Wiley-Interscience. 
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Global sequence similarity searches were performed using the amino acid sequences of 
all the conserved essential gene sequences (e.g., CFEs 1-117; SEQ ID NOS:114-226) to 
search against a non-redundant protein database using the BIAST2 algorithm (Altschul 
S.F., et al., 1997 Nucleic Acids Res. 25(17):3389-3402). In a similar search, similar 
sequences were identified in the Concordance database using the "Neighbor" function 
(Bruccoleri R. E., Dougherty T.J., Davison D.B. 1998 Nucleic Acids Res, 26(19):4482- 
4486). To determine if the predicted amino acid sequences were full length and in the 
proper reading frame, BLAST-type searching and CLUSTAL multiple sequence 
alignments (Higgins D.G., et al., 1996 Methods Enzymol 266:383-402) were used. 
Local sequence similarity searches were performed, by searching for Prosite (Hofinann 
K., et al, 1999 Nucleic Acids Res. 27(1):215-219) and Pfam motifs (Bateman A. 5 et al., 
2000 Nucleic Acids Res. 28(l):263-266). Additionally, the amino acid sequences of the 
CFEs were analyzed by performing protein threading analyses using the ProCeiyon fold 
recognition program (Sippl, et al., 1992 Proteins 13:258-271; Sippl, J. 1993 J. Comp. 
Aided Mol. Design 7:473-501; www.proceryon.com) and Geneformatics. 

In bacteria, many operons include genes encoding different proteins that catalyze discrete 
steps of a common biochemical pathway. Therefore, the operon structures in S. 
pneumoniae was compared with that in other bacteria in order to predict the function of 
CFE polypeptides. 

Additionally, analysis of bacterial metabolic pathways were performed using Pathway 
Tools from DoubleTwist, based on the EcoCyc system (Karp P.D., et al., 1999 Nucleic 
Acids Res. 1999 27(l):55-58). This analysis was used to predict which CFEs mediate 
various steps of the pathways. 

When the sequence identity between a CFE polypeptide and the annotated database (e.g., 
SwissProt, Genbank) was low (e.g., sequence identity less than about 30%), a Protein 
Threading (e.g., fold recognition) method was used to predict similarities in the folded 
protein structure of CFE polypeptides in the absence of a high level of sequence similarity 
with proteins in the databases (review by Teichmann, et al., 1999 Current Opinion in 
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Structural Biology 9:390-399). The Protein Threading method predicts the compatibility of 
a query sequence (e.g., CFE polypeptide sequences) with each of the folds in a library of 
known protein structures. The library of known protein structures as developed, maintained, 
and updated throughout the search process. 

A list of potential structural folds, onto which each query was compatible, was generated for 
all CFE polypeptides (e.g, SEQ ID NOS: 114-226). The fold assignments for each query 
were used to generate pairwise sequence alignments. The pairwise sequence alignments 
were used to generate protein models of the query polypeptide (e.g., CFE polypeptides). 

The pairwise sequence alignments were also used to compare the position of critical 
residues of the structural template with the query polypeptide. The list of critical residues 
was generated by using multiple sequence alignments derived from a structural 
classification of proteins to generate a conservation profile which provided sequence- 
specific positions conserved across a homologous family of protein folds. Comparative 
modeling was used to search the model of the query polypeptide for the critical residues and 
determine whether the structural and functional motifs are conserved in the query protein. 
Conservation of structural and functional motifs permitted assignment of putative structure 
and function to a query polypeptide sequence. 

The Protein Threading method was used to search for putative folded structure and function 
for all CFE polypeptides (SEQ ID NOS: 1 14-226). The CFE polypeptides having significant 
sequence identity (e.g., more than 30%) to known proteins were assigned putative functions 
with a high level of confidence. 

EXAMPLE 10 

The following provides a description of the methods used to characterize purified, CFE 
101 polypeptide. The 2CFE, 101 polypeptide mediates the conversion of pantothenate to 
4' phosphophantothenate, and is predicted to be a pantothenate kinase. 
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The computer-aided comparison, as described in Example 9 supra, suggests that the 
amino acid sequence of the CFE 101 polypeptide (SEQ ID NO:210) is 42% similar to the 
amino acid sequence of the coaA protein of E. coll Thus, CFE 101 may be a 
pantothenate kinase, which mediates the conversion of pantothenate to 4* 
phosphophantothenate (Figure 5). 

Circular Dichroism and Circular Dichroism Thermal Melt Analysis 

Circular dichroism and circular dichroism melt methods were used to determine the 
folded structure of the expressed and isolated 2CFE polypeptides. For example, this 
method was used to characterize the folded structure of isolated 2CFE 101 (SEQ ID 
NO:421). 

The starting concentration of the 2CFE 101 polypeptide was such that OD 2 os was 
approximately 1.5, and the OD 280 was approximately 0.05 (e.g., 0.05 to 0.1 mg/ml). The 
starting concentration of 2CFE 101 was approximately 344 pM in 50% glycerol, 50 mM 
Tris, 100 mM NaCl, 5 mM MgCl 2 , 0.5. mM EDTA, at pH 7.5. The polypeptide was 
diluted to a final concentration of 7 jaM, as determined by absorbance at A 28 o, in 20 mM 
Na-phosphate, 100 mM KC1, at pH 7.0. The circular dichroism analysis was performed 
using quartz cuvettes, the instrumentation was from JASCO (Model J-720), the readings 
were performed at 25 degrees C (Figure 6 A). The band width was 1 nm, the sensitivity 
was 20 mdeg, the response was 0.25 seconds, the scan speed was 50 nm/minute, and the 
step was 0.5. The circular dichroism thermal melt analysis was performed at a range of 
between 0 and 100 degrees C (Figure 6 B). Additionally, the circular dichroism was 
performed comparing monomer and aggregate pools of 2CFE 101. 
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Size Exclusion Analyses 

Size exclusion chromatography methods were performed using the Biosil SEC column, 
as described in Example 8 supra. The results suggest that the 2CFE 101 polypeptide 
5 forms monomer (40,200 Da) and oligomers (194,000 Da). The specific activity of the 
monomer and oligomeric forms of 2CFE 101 were determined, as described below. 

Biochemical Assays 

10 The biochemical assays of the 2CFE 101 polypeptide was based on the PK/LDH coupled 
enzyme assays described by Vallari, D. S., et al. (1987 J. Biol Chem. 262:2468-2471) 
and Song, W. -J., et al., (1994 J. Biol Chem. 269:27051-27058). 

Briefly, the assay was performed as follows. The reaction included: 885 jxl of 0.1 M 
15 Tris-HCl (pH 7.6), 25 \x\ NADH (14.1 mM), 20 yX ATP (10.7 mM), 50 \i\ phospho-enol- 
pyruvate (56 mM), 5 jil LDH/PK (lactose dehydrogenase/PK; Sigma, catalog # P-0294, 
60 U/ ml PK, 1050 U/ml LDH), 5 pi of the 2CFE 101 polypeptide (9 mg/ml in 50 mM 
Tris-HCl, pH 7.5, 100 mM NaCl which was diluted to 4.5 mg/ml in 50% glycerol). The 
reaction was started by adding 10 pi pantothenate (100 mM; Sigma, catalog # P2250). 
20 The production of ADP in the reaction was monitored by measuring the absorbance a 340 
nm. The results in Figure 8 show that the. 2CFE 101 polypeptide mediates ADP 
production in the presence of pantothenate and ATP. The K m of pantothenate (n=4) was 
144 (±16.5) pM, the V max of the 2CFE 101 polypeptide (n=4) was 2.04 (±0.25) \M min 1 
nig" 1 . The. monomer form has a specific activity of approximately 1.7 \iM min 1 mg" 1 . 
25 The oligomeric form has a specific activity of 0.26 jiM min" 1 mg" 1 . 

Alternatively, the 2CFE 101 polypeptide can be tested in an assay that monitors the 
conversion of pantothenate to 4'-phosphopantothenate. The same reaction described * 
above can be used, except H C-labeled pantothenate is used. The. reaction can be 
30 monitored by measuring the amount of 14 C-labeled 4'-phosphopantothanate produced. 
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EXAMPLE 11 

The following provides a description of the methods used to characterize purified, CFE 
39 and CFE 21 polypeptides, carrying a C-terminal histidine 6-tag. The methods include 
helicase reactions, in which synthetic Holliday Junction templates are resolved into 
duplex structures. In one method, helicase reaction was monitored using radiolabeled 
templates. In another method, the helicase assay was adapted for use in a high 
throughput assay employing fluorescence labeled templates. 

Computer-Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
39 polypeptide (SEQ ID NO: 148) is an RuvA homologue. The comparison also 
suggests that CFE 21 (SEQ ID NO:132) is an RuvB homologue. 

Previous studies by Parsons and others have shown that RuvA and RuvB proteins, in E. 
coli, promote branch migration or movement of Holliday Junctions during genetic 
recombination and DNA repair (Parsons, C. A. 5 et al., 1992 Proc. Natl., Acad. Sci. USA 
89:5452-5456; Tsaneva, I. R., et al., 1993 Proc. Natl, Acad. Sci. USA 90:1315-1319; 
Muller, B., et al., 1993 J. Biol Chem. 268:17179-17184; Mitchell, A. H. and S. C. West 
1996 J. Biol- Chem. 271:19497-19502; Parsons, C. A. and S. C. West 1993 J. Molec. 
Biol 232:397-405; Tsaneva, L R., et al., 1992 Molec. Gen. Genet. 235:1-10; Mitchell, A. 
H. and S. C. West 1994 J. Molec. Biol 1994 243:208-215). 

Size Exclusion Chromatography 

Size exclusion chromatography was performed on 2CFE 39 (SEQ ID NO:366) and 2CFE 
21 (SEQ ID NO:350) using the Phenomenex 600 x 7.5 mm Biosep SECS 3000 column, 
as described in Example 8 supra. Protein standards (BioRad) were used to calibrate the 
column, including thyroglobulin (670,000 Da), gamma globulin (158,000 Da), ovalbumin 
(44,00 Da), myoglobin (17,00 Da), and B-12 (1350 Da). 
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The results indicate that 2CFE 39 (RuvA) forms tetramers and 2CFE 21 (RuvB) forms a 
hexameric ring structure. Selected eluted samples were electrophoresed on a 
. polyacrylamide gel (Novagen) (Figure 9). 

5 

The Hollidav Junction Analysis Using Radiolabeled Templates 

The Holliday Junction analysis was performed using radiolabeled, synthetic, 
asymmetrical, Holliday Junction templates, as described in Hiom, K. and S. C. West 
10 1995 Cell 80:787-793. The Holliday Junction templates were produced by annealing 
together four separate, single-stranded, oligonucleotide strands to form four-stranded 
structures (e.g., the Holliday Junction template). The Holliday Junction templates were 
reacted with the 2CFE 39 and 2CFE 21 polypeptides, in a helicase reaction, to test their 
ability to generate two duplex structures. 

15 

Producing the Synthetic Hollidav Junction Templates 

* The asymmetrical Holliday Junction templates were produced by annealing the following 
oligonucleotide sequences: 

20 

Oligonucleotide strand 1: 

5'-CCAGTGATCACATACGCTTTGCTAGGACATCTTGATATCAGCCCACGTT 
CACCCGCCTACCAGTGCCACGTTGTATGCCCACGTTGACC-3' (SEQ ID NO:438) 

25 Oligonucleotide strand 2: 

5 5 -GGGTCAACGTGGGCATACAACGTGGCACTGGTAGGCGGGTGAACGTGGG 
CTGATATCAAGATGTCCATCTGTCCGTtCATCTATGACGT-3' (SEQ ID NO:439) 

Oligonucleotide strand 3: 
30 5'-AACGTCATAGATGAACGGACAGATCATGGTGCTTTTAAAGTCTAGAGAC 

TATCGAGCATTAGTACCAGTATCGAATCCGTCTTGTCAA-3 , (SEQ ID NO:440) 
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Oligonucleotide strand 4: 

5 ' -TTTG AC AAGACGGATTCGATACTGGTACTAATGCTCGATAGTCTCTAGAC 
TTTAAAAGC ACC ATGTAGC AAAGCGTATGTGATCACTG-3 * (SEQ ID NO:441) 

5 

Oligonucleotide strand 3 was labeled at the 5* end using approximately 300 ng of 
oligonucleotide strand 3, 1 \xl lOx Phosphate Buffer, 5 jal 32 P ATP, 1 fJ T4 polynuclotide 
kinase (Gibco-BRL)), in a 10 \xl volume, and the reaction was performed at 37 degrees C 
for 30 minutes. The reaction was loaded onto a G50 column to remove the 
10 unincorporated radiolabel. The final concentration of the radiolabeled oligonucleotide 
strand 3 was approximately 15 ng per fil. 

Approximately equimolar amounts of the four oligonucleotide strands were annealed 
(e ; g., hybridized). The annealing reaction included: 5 jal Annealing Buffer (200 mM 

15 Tris-Cl pH 8.0, 100 mM MgCl 2 , 1 M NaCl, 10 mM DTT); 450 ng of radiolabeled 
oligonucleotide strand 3; and 1000 ng each of oligonucleotide strands 1, 2, and 4; in 50 \d 
total reaction volume. The control annealing reaction included: 5 ^1 Annealing Buffer, 
60 ng radiolabeled oligonucleotide strand 3; 1000 ng oligonucleotide strand 4; in 50 
total reaction volume. Annealing was performed at 95 degrees C for 5 minutes, 65 

20 degrees C for 30 minutes, 42 degrees C for 30 minutes, and room temperature (e.g., 
between about 23 to 27 degrees C) for 30 minutes to generate the synthetic Holliday 
Junction templates. The synthetic Holliday Junction templates were gel or column- 
purified to remove the duplex and non-annealed products. As a control, oligonucleotide 
strands 3 and 4 were annealed to form duplex structures. The synthetic Holliday Junction 

25 templates and duplex structures were stored at -20 degrees C. 

CFE 39 and CFE 21 : The Helicase Reaction Using Radiolabeled Templates 

The helicase reaction was performed to determine whether 2CFE 39 and 2CFE 21 
30 resolved the synthetic Holliday Junction templates into duplex structures. The helicase 
reaction was performed as follows. A 50 pi total reaction volume included: 25 |il of 2x 
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Reaction Buffer (50 mM Tris-Cl pH8.0, 30 mM MgCl 2 , 2 mM ATP); 1 yl synthetic 
Holliday Junction template (36 ng); 2 nl 2CFE 39 (1 |aM); and 2 yd 2CFE 21 (1 fiM). 
The reaction was incubated at 37 degrees for 30 minutes. The reaction was stopped by 
adding 5 )il Stop Buffer (100 mM Tris-Cl pH 7.5, 5 mg/ml Proteinase-K, 5% SDS). The 
5 stopped reaction was returned to 37 degrees C for 5 minutes. The helicase reaction was 
loaded onto and run on a non-denaturing, 12% PAGE, Tris-glycine gel. 

The results shown in Figure 10, lanes 6, 7 and 8, indicate that the 2CFE 39 and 2CFE 21 
polypeptides resolved the synthetic Holliday Junction templates into duplex structures. 

10 

CFE 39: The Helicase Reaction 

It has been previously shown that E. coli RuvA binds to Holliday Junction templates 
(Parsons, C. A., et aL, 1992 Proc. Natl, Acad Sci USA 89:5452-5456). The ability of & 

15 pneumoniae CFE 39 to bind to a Holliday Junction template can be tested by employing 
the helicase assay described herein. The results of the helicase assay can be monitored by 
performing a gel shift assay and/or capillary electrophoresis. The presence of a Holliday 
Junction template bound to 2CFE 39, which migrates more slowly than the Holliday 
Junction template alone, would indicate that S. pneumoniae 2CFE 39 binds to Holliday 

20 Junction templates. 

CFE 39 and CFE 21 : Hollidav Junction Analysis Using Fluorescen t-Labeled Templates 

The helicase reaction described herein was performed using Holliday Junction templates 
25 having one oligonucleotide strand labeled with a fluorescent agent and another strand 
labeled with a quenching agent. The 5 5 fluorescent end and the 3' quenching end of the 
strands that make up the Holliday Junction templates are in proximity to each other, 
resulting in a non-fluorescent template. When the Holliday Junction templates are 
resolved into duplex structures, the fluorescent and quench ends are not in proximity to 
30 each other, resulting in fluorescence. 
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The Holliday Junction templates used to perform this experiment comprised the 
following: the 5' end of oligonucleotide strand 1 was labeled with a fluorescein (e.g., the 
fluorescent agent), and the 3* end of oligonucleotide strand 4 was labeled with DABCYL 
(e.g., the quenching agent). The oligonucleotide strand 1 labeled with fluorescein and the 
5 oligonucleotide strand 4 labeled with DABCYL were custom synthesized (Gibco-BRL 
Life Technologies, Inc.). 

The fluorescein and DABCYL labled oligonucleotides were annealed in a reaction, as 
described above, to generate synthetic Holliday Junction templates. The helicase reaction 
10 was performed as described above. The results of the helicase reaction were monitored 
by measuring the unquenching of the Holliday Junction templates with time (Figure 11). 

The helicase assay using Holliday Junction templates labeled with fluorescent-quenching 
agents can be adapted for use in high throughput analyses to test 2CFE 39, 2CFE 21, and 
1 5 other polypeptides for their ability to resolve the templates into duplex structures. 

EXAMPLE 12 

The following provides a description of the methods used to characterize purified, CFE 8 
20 polypeptide, which lacks a histidine tag. The CFE 8 is a putative DNA single-stranded 
binding protein. 

Computer- Aided Comparison 

25 The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
8 polypeptide (SEQ ID NO:121) may be a single stand binding protein homologue, such 
asSSB. 
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Size Exclusion Chromatography 

The 2CFE 8 polypeptide (SEQ ID NO:339) was characterized by size exclusion 
chromatography, using the Biosil SEC-125 HPLC Gel Filtration column as described in 
5 Example 8 supra. The chromatogram showed one peak corresponding to a molecular 
weight of approximately 89 kDa. Based on the nucleotide sequence, the predicted 
molecular weight of 2CFE 8 is 17,351 Da. In non-denaturing conditions, 2CFE 8 forms a 
multimer. 

10 Binding Reaction 

The 2CFE 8 polypeptide was reacted with a single-stranded oligonucleotide A. Briefly, 
the binding reaction included: 50 |iM of 2CFE 8 polypeptide, 50 jiM oligo strand A, 20 
mM Tris/20 mM KC1 pH 7.5. The binding reaction was performed at 37 degrees C, for 2 
15 hours. 

Oligonucleotide strand A: 

5'-TTAGGGCCCGGGCTATCTTACAATCTCGTT-3 ' (SEQ ID NO:442) 
20 Capillary Electrophoresis 

The results of the binding reaction was monitored by capillary electrophoresis, following 
the methods described in "Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. 
J. Landers. 

25 

Separation was performed using an uncoated capillary tube (360 jim o.d., 50 \\m i.d., 
with a 50 cm effective separation length; Watrex International, Inc., Pittsford, NY) and 
50 mM borate pH 9.3 as the mobile phase, at 25 kVolts, 20 minutes separation time. 

30 The results indicate that 2CFE 8 alone elutes as a sharp peak, indicating little adsorption 
to the uncoated capillary wall (Figure 12 A). The shape of the peak and peak retention 
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time changed with 2CFE 8 in the presence of all oligonucleotides tested (Figure 12 B). 
As a negative control, MurB polypeptide (Pucci, M. J., L. F. Discotto, and T. J. 
Dougherty 1992 "Cloning and Identification of the Escherichia coli murB DNA 
sequence, which encodes UDP-N-acetylenoIpyruvoylglucosamine reductase" J. 
5 Bacterial 174:1690-1693) was reacted with the same oligonucleotides. MurB reacted 
with or with out the oligonucleotides showed no change in peak shape or retention time. 

After capillary electrophoresis analyses, the 2CFE8 alone and 2CFE plus oligonucleotide 
samples were run on native polyacrylamide gels to determine whether the polypeptide 
1 0 was intact. The results indicate that in all cases, 2CFE 8 was intact and had not degraded 
with time or storage. 

Mobility Shift Assays 

15 The ability of 2CFE 8 polypeptide to bind oligonucleotide strand A was tested in a 
mobility shift assay. 

The results indicate that 2CFE 8 binds single stranded oligonucleotides (Figure 13 A and 
B). In Figure 13 A, the gel was stained with ethidium bromide. The unbound 
20 oligonucleotides appear near the bottom of the gel, while the bound oligonucleotides 
appear near the middle. The same gel was stained with Coomassie (Figure 13 B), 
revealing that 2CFE 8 polypeptide bound -to the oligonucleotide migrated further than 
unbound 2CFE 8, due to the change in charge carried by the oligonucleotide. Various 
ratios of 2CFE8:oligo were tested. The optimal binding ratio was 2:1 . 

25 

The Effect of MgCk 

The 2CFE 8 polypeptide precipitated in the presence of 5 mM MgCh. The precipitation 
was reversible by the addition of 1 jiM of the oligonucleotides tested. The observation 
30 indicates specific binding between 2CFE 8 polypeptide and the oligonucleotides tested. 



92 



WO 01/49721 

Scintillation Proximity Assay 



PCT/US00/35604 



Scintillation proximity, assay (SPA) methods can be used in a high throughput screening 
procedure to monitor, for example, a binding reaction. SPA utilizes beads (Amersham) 
5 which are coated on the surface with a particular compound or molecule. For example, 
the SPA bead may be coated with avidin to facilitate binding with any molecule having a 
biotin tag. 

The binding reaction of the 2CFE 8 polypeptide and the oligonucleotide strand A can be 
10 monitored using SPA beads and a scintillation counter. The beads can be coated with 
avidin, the 2CFE 8 polypeptide can be tagged with biotin, and the oligonucleotide strand 
A can be radiolabeled. 

EXAMPLE 13 

15 

The following provides a description of the methods used to characterize purified, 2CFE 
3 (SEQ ID NO:334) and 2CFE 86 (SEQ ID NO:409) polypeptides. 

The 2CFE 3 polypeptide catalyzes the conversion of D-glucosamine-6-phosphate to D- 
20 glucosamine- 1 -phosphate, indicating that 2CFE 3 mediates amino-sugar biosynthesis 
through the N-acetyl glucosamine pathway (Figure 14). 

The 2CFE 86 polypeptide catalyzes the conversion of D-glucosamine-1 -phosphate to N- 
acetylglucosamine-1 -phosphate, and the conversion of N-acetylglucosamine-1 -phosphate 
25 to UDP-N-acetylglucosamine-1 -phosphate, which indicates that 2CFE 86 also mediates 
amino-sugar biosynthesis through the N-acetyl glucosamine pathway (Figure 14). 

Computer- Aided Comparisons Of CFE 3 

30 The computer-aided comparison, as described in Example 9 supra, suggested that the 
CFE 3 polypeptide (SEQ ID NO: 11 6) is a phosphoglucosamine mutase, such as GlmM. 
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Purification of the CFE 3 Polypeptide 

The 2CFE 3 polypeptide was produced using the large scale IPTG-induced method 
described in Example 5, supra. The 2CFE 3 polypeptide lacks a C-terminal histidine tag. 
The 2CFE 3 polypeptide was purified using a 2-column procedure. The 2CFE 3 
polypeptide preparation was eluted from a 26/10 Q Sepharose column (Pharmacia) using 
a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the gradient size was 1 liter. Then 
the 2CFE 3 polypeptide was eluted from a hydroxyapatite Bio-gel column (Bio-Rad) 
using a 5-200 mM potassium phosphate (pH 8.0) gradient, the flow rate was 0.3 
ml/minute, and the gradient size was 300 ml. A sample of the 2CFE 3 preparation was 
electrophoresed on an SDS polyacrylamide gel (Figure 4). 

Affinity Capillary Electrophoresis of CFE 3 

Affinity capillary electrophoresis methods were used to determine whether the 2CFE 3 
polypeptide binds to various glucose derivatives. Binding was performed under 
equilibrium conditions, in which the sugars were dissolved in the running buffer and 
reacts with 2CFE 3 during separation in the column. The affinity capillary 
electrophoresis method used to analyze 2CFE 3 follows the methods described in 
"Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. J. Landers. 

Briefly, 2CFE 3 polypeptide was reacted with increasing amounts of various glucose 
derivatives (e.g., substrate) at 25, 30 and 37 degrees C. The glucose derivatives included 
UDP-glucose, glucose- 1 -phosphate, glucose-6-phosphate, glucosamine- 1 -phosphate, and 
glucosamine-6-phosphate. The reaction included: 2CFE 3 polypeptide (2.0 mg/ml), 
separation buffer (25 mM Tris; 192 mM Glycine, pH 8.0; BupH Tris-Glycine Buffer 
Packs, Pierce). Separation was performed at 25 kVolts, separation time was 15 or 20 
minutes. 

The results shown in Figure 15 A indicate that at 25 degrees C, 2CFE 3 binds to D- 
glucose- 1 -phosphate in a dose-dependent manner, as the peak shape and/or the retention 
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time for 2CFE 3 changes in the presence of 100 and 500 j^M D-glucose-1 -phosphate 
compared to unreacted 2CFE 3. 



The results shown in Figure 15 B indicate that at 25 degrees C, 2CFE 3 binds to D- 
glucosamine-6-phosphate in a dose-dependent manner, as the peak shape and/or the 
retention time for 2CFE 3 changes in the presence of 100 and 500 |iM D-glucosamine-6- 
phosphate compared to unreacted 2CFE 3. . 

The results shown in Figure 15 C indicate that at 25 degrees C, the 2CFE 3 polypeptide 
also binds to glucose-6-phosphate. 

A comparison of 2CFE 3 reacted with various glucose derivatives, at 30 degrees C, is 
shown in Figure 1 5 D. The results indicate that D-glucosamine-6-phosphate is a putative 
substrate for 2CFE 3, as this reaction exhibits the greatest change in peak shape and/or 
retention time. 

CFE 3: Capillary Electrophoresis and Laser-Induced Fluorescence 

In a further analysis of 2CFE 3 polypeptide, capillary electrophoresis was performed with 
laser-induced fluorescence in order to separate and detect interaction between the 
substrate (e.g., D-glucosamine-6-phosphate) and the product (e.g., D-glucosamine-1- 
phosphate) in a one dose, one time-point procedure. 

The 2CFE 3 polypeptide was derivitized by reacting 10 mM FITC (fluorescein 
isothiocyanate dissolved in methanol; Calbiochem, San Diego, CA) with D-glucosamine- 
6-phosphate, at ambient temperature, in the dark, overnight. The FlTC-derivatized 2CFE 
3 polypeptide (2.0 mg/ml) was reacted with the substrate (D-glucosamine-6-phosphate 
and D-glucosamine- 1 -phosphate) for one hour. 

Separation was performed using an uncoated capillary (360 pm o.d., 50 |om i.d., with a 
50 cm effective separation length) and 50 mM borate (pH 9.3) as the mobile phase. The 
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argon-ion laser had an excitation wavelength of 488 nm and an emission filter of 520 nm 
(Beckman, Fullerton, CA). The results shown in Figure 16 indicate that 2CFE 3 binds 
and catalyzes the conversion of D-glucosamine-6-phosphate to D-glucosamine-1- 
phosphate. 

5 

Computer-Aided Comparison Of CFE 86 

The comparison results, as described in Example 9 supra, suggested that the CFE 86 
polypeptide (SEQ ID NO: 195) is an acetyltransferase, such as GlmU which is a 

1 0 Afunctional enzyme in E. coli. It has been previously shown that, in E coli, GlmU is a 
Afunctional protein having both the acetyltransferase and uridylyltransferase active sites 
(Mengin-Lecreulx, D. and J. van Heijennort 1994 J. Bacteriol 176:5788-5795; Gehring, 
AL, et al., 1996 Biochemistry 35:579-585). The bifimctional enzyme catalyzes the 
conversion of D-glucosamine-1 -phosphate to N-acetylglucosamine-1 -phosphate 

15 (acetyltransferase), and catalyzes the conversion of N-acetylglucosamine-1 -phosphate to 
UDP-N-acetylglucosmine-l -phosphate (uridylyltransferase). The Km of the 
acetyltransferase and uridylyltransferase reactions has been previously calculated 
(Mengin-Lecreulx, D. and J. van Heijennort 1994 supra ). Additionally, the crystal 
structure of GlmU from E. coli is known (Brown, K., et aL, 1999 EMBO J. 18:4096- 

20 4107). 

Purification of the CFE 86 Polypeptide 

The 2CFE 86 polypeptide (SEQ ID NO:409) has a C-terminal histidine tag. The 2CFE 
25 86 polypeptide was produced using the large scale IPTG-induced method described in 
Example 5, supra. The 2CFE 86 polypeptide was purified using the Ni-NTA affinity 
column method described in Example 6, supra. The eluted 2CFE 86 polypeptide was 
dialyzed against 50 mM Tris-Cl, 100 mM NaCl, 25% glycerol, pH 8.0. Samples of the 
purified 2CFE 86 polypeptide were electrophoresed on a polyacrylamide gel (Figure 17). 

30 
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A biochemical assay was performed, to determine whether 2CFE 3 and 2CFE 86 convert 
D-glucosamine-6-phosphate to UDP-N-acetylglucosaxnine-1 -phosphate (e.g., UDPAG). 
5 The 2CFE 3 and 2CFE 86 polypeptides were used in a coupled reaction based on the 
assays described in Jolly, L. P., et al., 1999 Eur. J. Biochem. 262:202-210. 

A time-dependent and dose-dependent assay were performed. Briefly, the assay was 
performed in 96- well plates, each well including 100 \x\ volume. The assay included: 1 

10 mM D-glucosamine-6-phosphate (Sigma); 0.7 mM D-glucosamine-l,6-diphosphate 
(Sigma); 1.2 mM acetyl-Coenzyme A (Sigma); and 5 mM uridine-5' -phosphate (Sigma); 
3 mM MgCl 2 (Sigma); 50 mM Tris-Cl, pH 8.0 (Life Technologies). The reaction was 
started by adding 1 \ig of 2CFE 3; and 10 ng of 2CFE 86. The reaction was performed at 
room temperature. The reaction was stopped at 0 5 15, 30, and 65 minutes, by filtering out 

15 the 2CFE polypeptides. 

The results of the assay was monitored by HPLC (high pressure liquid chromatography) 
using an Optisil 10p. SAX column (250 x 4.6 mm), measuring at 262 nm, the mobile 
phase was 150 mM KH 2 P0 4 (pH 3.5), and 1.5 ml/minute flow rate. The results shown in 
20 Figure 18 show the time-dependent assay and indicate that HPLC detected the presence 
of UDPAG. 

CFE 86: The Uridvlvltransferase Reaction 

25 The 2CFE 86 polypeptide was tested in a uridylyltransferase reaction, in which N-acetyl- 
D-glucosamine-l-phosphate and UTP produce UDP-N-acetylglucosamine. The 
uridylyltransferase reaction was monitored using a malachite green/inorganic 
pyrophosphatase assay (e.g., malachite green-IPPAse assay) and/or monitored using 
HPLC. The malachite green-IPPAse assay was used to measure orthophosphate 

30 production from digestion of the pyrophosphate liberated in the uridylyltransferase 
reaction. 
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The malachite green reagent was prepared as follows. A 0.045 % solution of malachite 
green (Sigma; M9636) was prepared in water. A 4.2 % solution of ammonium 
molybdate (Mallinckrodt) was prepared in 4N HC1. The malachite green and ammonium 
5 molybdate were mixed in a 3:1 ratio, and stirred for about 20 minutes. The mixture was 
filtered, and stored at 4 degrees C. The inorganic pyrophosphatase (Sigma; 1-2267) was . 
diluted to 0.1 U/p.1 in 50 mM Tris/3mM MgCl 2 ph 8.0, and stored at 4 degrees C. 

The*uridylyltransferase reaction was performed in 96-well plates, The coupled reaction 
10 described herein was performed, in the presence of 2CFE 3 alone or 2CFE 3 and 2CFE 
86, and included the addition of 0.5 U/well of the diluted inorganic pyrophosphate. The 
reaction was mixed for 5 minutes at room temperature. The reaction was stopped by the 
addition of 240 |il/well of the malachite green reagent and 30 jil/well of 34% sodium 
citrate, and the reaction was mixed. Hie results of the uridylyltransferase reaction was 
1 5 monitored by spectrophotometry at 660 nm. 

The results of separate uridylyltransferase reactions were monitored by HPLC, using a 
Phenosphere-NEXT CI 8 column (250 x 4.6 mm). The mobile phases included A and B 
as follows: A) methanol/10 mM potassium phosphate pH 6.5 (0:100); and B) 
20 methanol/10 mM potassium phosphate pH 6.5 (40:60). The mobile phases were run 
under the following conditions: 100% mobile phase A for 5 minutes,' to 100% mobile 
phase B in 3 "minutes; and hold 100% mobile phase B for 9 minutes. The retention time 
for the UDPAG product is approximately 5.75 to 6.0 minutes. 

25 The results three uridylyltransferase reactions, monitored by HPLC are summarized in 
Table HI below. 
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TABLE HI 



Purified CFE 86; 

2CFE 86-1 
2CFE 86-2 
2 CFE 86-3 



Specific Activity 
(nmol/min/ug): 

3.1 
3.4 
3.1 



The results of the uridylyltransferase reactions, monitored by HPLC or HPLC and 
Malachite Green IPPAse assays are summarized in Table IV below. 

TABLE IV 



Reaction; 

Acetvltransferase reaction: 

Glucosamine- 1-P 
Acetyl-coA 

Uridvlvtransferase reaction: 

N-acetylglucosamine- 1 -P 
UTP 



Km fuM); 



94 
150 



48 
79 



Method: 



HPLC 
HPLC 



HPLC and MG/IPPAse 
HPLC 



EXAMPLE 14 



The following provides a description of the methods used to characterize various 2CFE 
polypeptides, including CFE 21, 34, 35, 39, and 90. The molecular weight of these 2CFE 
polypeptides were analyzed by size exclusion chromatography and gel electrophoresis. 
The 2CFE 34, 35, and 90 polypeptides putatively mediate fatty acid biosynthesis. 
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The computer-aided comparison, as described in Example 9 supra, suggests that CFE 34 
(SEQ ID NO:143), CFE 35 (SEQ ID NO: 144), and 90 (SEQ ID NO: 199) are 
5 polypeptides which mediate a fatty acid biosynthesis pathway (Figure 1 9) 

The comparison suggests that CFE 34 is a malonyl CoA:ACP transcylase, which 
catalyzes the reaction in which malonyl CoA and acyl carrier protein (ACP) are 
converted to malonyl-ACP and CoA. Thus, the CFE 34 polypeptide may be a homologue 
10 of £ co// FabD. 

The comparison suggests that CFE 90 is a 3-oxoacyI-ACP synthase II (beta ketoacyl- 
ACP synthase H) which catalyzes the reaction in which malonyl-ACP is converted to 
beta aceto acetyl-ACP. Thus, the CFE 90 polypeptide may be a homologue of E. coli 
15 FabF. 

The comparison suggests that CFE 35 is a 3-oxoacyl-ACP reductase (beta aceto acetyl 
ACP reductase) which catalyzes the reaction in which beta-keto-acetyl-ACP is converted 
to beta-hydroxy-acetyl-ACP. Thus, the CFE 35 polypeptide may be a homologue of R 
20 coli FabG. 

Size Exclusion Chromatography 

The estimated molecular' weights of 2CFE 34 (SEQ ID NO:361), 2CFE 35 (SEQ ID 
25 NO:362), and 2CFE 90 (SEQ ID NO:413) were determined using the Biosil SEC-125 
HPLC Gel Filtration column as described in Example 8, supra. 

The results suggest that 2CFE 34 polypeptide is a monomeric protein (33,093 Da), 2CFE 
35 is a trimeric protein (25,758 Da; approximately 85%), and 2CFE 90 is a dimeric 
30 protein (43,930 Da). Selected eluted samples of 2CFE 34 were electrophoresed on a 
polyacrylamide gel (Figure 20). 
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Biochemical Assay: CFE 34 

The function of 2CFE 34 was determined by performing various biochemical reactions. 
5 To determine whether 2CFE 34 catalyzes the convertion of malonyl-CoA to malonyl and 
CoA, the following reaction was performed. 

The biochemical reaction was performed in the presence of acyl carrier protein. The 
reaction included the following: 10 yM 14 C labeled malonyl-CoA, 20 nM ACP, 30 jaM 

10 2CFE 34 (e.g., FabD) in 20 mM Tris-Cl, pH 8.0 and 5 mM DTT in 300 yd volume. The 
reaction was performed at room temperature (e.g., approximately 24 degrees C) for 30 
minutes. The reaction was terminated with the addition of 45^1 of 0.5% TFA. The 
labeled reaction was injected onto a MonoQ 5/5 column on a Gilson HPLC. Detection 
was performed by monitoring the radioactivity of the continuous flow-through of the 

15 HPLC effluent. Chromatography was performed using a buffer gradient for column 
elution. Buffer A included 20 mM Tris-Cl, pH 8.3. Buffer B was the same as Buffer A 
and included 1 M NaCl. The program was held at 90% A, 10% B for 10 minutes 
followed by a linear ramp to a final mix of 50% of each Buffer A and B over 10 minutes. 

20 The substrate (e.g., I4 C malonyl-CoA) eluted at 9.9 minutes, the product (e.g., 14 C 
malonyl-ACP) eluted at 14.3 minutes. The results indicate that CFE 34 catalyzes the 
conversion of malonyl-CoA and acyl carrier protein (ACP) to malonyl-ACP and CoA. 

EXAMPLE IS 

25 

The following provides a description of the methods used to characterize CFE 
polypeptides 40, 41, and 46. 
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The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
40 polypeptide (SEQ ID NO: 149) is a phosphomethylpyrimidine (HMP-P) kinase 
5 involved in thiamine biosynthesis. 

• The comparison, as described in Example 9 supra, suggests that the CFE 4.1 polypeptide 
(SEQ ID NO: 1 50) has a GTP-binding motif and may be a protease. 

10 The comparison, as described in Example 9 supra, suggests that the CFE 46 polypeptide 
(SEQ ID NO: 155) has an ATP-binding motif. 

Affinity Purification of CFE 41 

15 The large-scale method described in Example 5 supra (e.g., IPTG-induced protein 
production) was used to prepare a sample of 2CFE 41 polypeptide (SEQ ID N0.368). 
The sample was affinity purified using the Ni-NTA method described in Example 6, 
supra. The eluted fractions were loaded onto and run on a 12% SDS-PAGE gel (Novex) 
(Figure 21). 

20 

Circular Dichroi sm and Circular Dichroism Thermal Melt Analysis 

Circular dichroism and circular dichroism thermal melt methods were performed using 
JASCO instrumentation. The concentration of the isolated 2CFE 40 (SEQ ID NO:367) 
25 was approximately 21 \M, in a 0.1 cm pathlength cell at 210 nm. The circular 
dichroism spectrum suggests that this preparation of 2CFE 40 had mixed alpha and beta 
secondary structure. The circular dichroism thermal melt spectrum suggests that 2CFE 
40 has a T m of approximately 67 degrees C, The 2CFE 40 polypeptide precipitates at 
approximately the T m . 
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The concentration of the isolated 2CFE 41 (SEQ ID NO:368) was approximately 70 nM, 
in a 0.02 cm pathlength cell. The circular dichroism spectrum suggests that this 
preparation of 2CFE 41 had mixed alpha and beta secondary structure, with a greater 
5 percentage of alpha structures. The circular dichroism thermal melt spectrum suggests 
that 2CFE 41 has a T m of approximately 38 degrees C. The 2CFE 41 polypeptide 
precipitates at approximately the T m . 

The concentration of the isolated 2CFE 46 (SEQ ID NO:373) was approximately 23 |aM, 
10 in a 0. 1 cm pathlength cell at 280 nm. The circular dichroism spectrum suggests that this 
preparation of 2CFE 46 had mixed alpha and beta secondary structure. The circular 
dichroism thermal melt spectrum suggests that 2CFE 46 is highly stable at elevated 
temperatures. At 90 degrees C, the 2CFE 46 polypeptide exhibited only a 27% loss in 
signal and the polypeptide remained soluble. 

15 

Capillary Electrophoresis 

Capillary electrophoresis was performed on samples of purified 2CFE 40, 41 and 46. 
The electropherograms of 2CFE 40, 41, and 46 are shown in Figure 22. 

20 

EXAMPLE 16 

The following provides a description of methods that can be used to characterize CEG 
polypeptides (e.g., CFE polypeptides). 

25 

Computer- Aided Compilation 

Computer-aided compilation of bacterial metabolic pathways may be analyzed using 
Pathway Tools from Doubletwist, based on the EcoCyc system (Karp P.D., et al, 1999 
30 Nucleic Acids Res. 1 999 27(l):55-58). This analysis may be used to predict which CFEs 
mediate various steps of the pathways. This information may be used in combination 
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with the results of a binding reaction which identifies a ligand or substrate that binds with 
a CFE polypeptide. 



Identifying the Function of a CFE Polypeptide 

The function of a CFE polypeptide may be identified by identifying a ligand or substrate 
which binds with the CFE polypeptide. The ligand or substrate may be identified using 
fractionation and affinity capillary electrophoresis methods. The following method is 
based upon the assumption that the bacterial cell lysate includes the ligand or substrate. 

A bacterial host cells carrying an endogenous (e.g. native) CFE gene or carrying a 
recombinant vector which includes a CFE gene may be cultured so that the CFE 
polypeptide is produced by the cell. The cells may be ruptured in order to obtain the cell 
lysate. The cell lysate may be fractionated using HPLC technology. The HPLC fractions 
may be reacted with a CFE polypeptide in a binding reaction, and the binding reaction 
may be analyzed by affinity capillary electrophoresis methods. The ligand or substrate 
which reacts with the CFE polypeptide may be identified using mass spectrophotometry 
methods (in "Mass Spectrometry" 1990 eds. McCloskey, J. A., in Methods in 
Enzymology volume 193; Henion, J., et al., 1993 "Mass Spectrometric Investigations of 
Drug-Receptor Interactions" Ther. Drug Monit. 15:563-569; Loo, J. A., et al., 1999 
"Application of Mass Spectrometry for Target Identification and Characterization" Med. 
Res, Rev. 19:307-319; Nguyen, D. N., et al., 1995 "Protein Mass Spectometry: 
Applications to Analytical Biotechnology J. ChromatogrJ05:2l-45). 

EXAMPLE 17 

The following provides a description of nuclear magnetic resonance (NMR) spectroscopy 
methods that were used to characterize CFE polypeptides. 

High resolution NMR spectroscopy was applied to 15 N-labled, ,3 C/ 15 N-labeled, 
2 H/ 13 C/ I5 N-labeled, and type-specifically isotopically labeled CFE polypeptide samples 
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in the solution state for the following purposes: to assess various aspects of the structural 
state, e.g., foldedness, structural integrity; to refine a previously determined experimental 
structure of a close sequence homologue; to refine a homology-modeled structure; to 
assess the potential for a CFE polypeptide to bind small molecules; and to identify small- 
5 molecule pharmacophore fragments that bind specifically to the CFE polypeptide 
(VNuclear Magnetic Resonance" 1994 eds. James, T. L. in Methods in Enzymology 
volume 239). 

The NMR analysis includes screening both a compound deck of approximately 4,500 
10 commercially available, structurally and chemically diverse compounds (the small- 
molecule pharmacophore deck) and a compound deck of proprietary, known, anti- 
microbial compounds (anti-microbial deck) against the CFE polypeptides (i.e., target 
polypeptides) to determine, either based upon perturbations to the chemical shifts of the 
amide proton and/or nitrogen resonances, as measured from a two-dimensional proton- 
15 nitrogen heteronuclear single-quantum correlation spectrum (2D screening method), or 
based upon increases in the linewidth of the compound's proton resonance(s), as 
measured by a one-dimensional T )p spin-lock difference spectrum (ID screening 
method), both whether a compound binds to a CFE polypeptide and, in the case of the 2D 
screening method, where the compound binds on the CFE polypeptide. 



20 



Isotopic Labeling of CFE Polypeptides 

BL21-DE3 E. coli bacteria are transformed with the CFE expression vectors. Expression 
takes place between 20°C and 37°C in mmimal media containing [ I5 N]-ammonium 
sulfate as the sole nitrogen source and either glucose, [ 2 H]i 3 -glucose, or [ 13 C] 6 -glucose as 
the sole carbon source. Glucose is used for.preparing uniformly 15 N-labeled and 2 H/ 15 N- 
labeled CFE polypeptides. [ 2 H] ]3 -glucose is used for preparing type-specifically 'H/ I3 C- 
labeled, uniformly 15 N-labeled CFE polypeptides. [ I3 C] 6 -glucose is used for preparing 
13 C/ 15 N-labeled CFE polypeptides. The minimal media is prepared in 100% H 2 0 for 
expressing both uniformly 15 N-Iabeled and uniformly 13 C/ 15 N-labeled CFE polypeptides; 
the minimal media is prepared in 95% D 2 0 (deuterium oxide) and 5% H 2 0 for expressing 
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both type-specifically ^"C-labeled, uniformly 1 ^-labeled, and just uniformly 2 H/ 15 N- 
labeled CFE polypeptides. In the case of type-specifically 'H/^C-labeled, uniformly 15 N- 
labeled CFE polypeptides, 40 mg/L of protonated and uniformly 13 C /15 N-labeled 
isoleucine, valine and leucine amino acids are added to the minimal media. 

NMR Screening 

Compounds in the anti-microbial deck are pre-dissolved to a target concentration of 16 
mM in deuterated DMSO (dimethylsulfoxide) with each deck well containing only one 
compound. Compounds in the small-molecule, pharmacophore deck are pre-dissolved in 
deuterated dmso to a target concentration of 50 mM in groups of 8, i.e., each deck well 
contains 8 unique compounds with each compound at a target concentration of 50 mM. 

3.5 ill of compound is placed at the bottom of a well in .a 96-well, screening plate. This 
well will be referred to as the compound screening well. Each compound screening well 
contains solution from only one deck well. 166.5 yd of buffer is added to each compound 
screening well. 170 \i\ of a CFE polypeptide solution, initially at a concentration ranging 
from 200-300 jiM, is added to each compound screening well; the contents of that well 
are then thoroughly mixed. The control screening well contains only 3.5 \il of deuterated 
dmso. The screening plate is then centrifuged in a bucket rotor for 15 minutes at 3,500 
rpm to insure that all particulate matter is at the bottom of the well. 

The 2D screening method requires a single control screening well in which the compound 
solution consists only of deuterated DMSO. The ID screening method requires a control 
screening well for each compound screening well. In the case of the ID screening 
method, the control screening well is prepared identically to the compound screening 
well except that the 170 pi of a CFE polypeptide solution is replaced by 170 |il of buffer. 

The screening plate is covered with aluminum foil and placed onto a rack of a Gilson 
liquid handler. The Gilson liquid handler, under computer control by the NMR host/data- 
acquisition software, is responsible for removing each sample from the screening plate, 
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injecting the sample into a high-resolution, ! H/ I5 N double-resonance NMR flow-probe, 
removing the sample from the flow-probe, and dispensing it back into the screening plate 
well from which the sample was originally removed. NMR data are collected on the 
sample while the sample resides in the NMR flow-probe. The type of NMR data 
5 collected depends upon whether the 2D or ID screening method is being used. 

Determining Structural Characteristics of a CFE Polypeptide 

In assessing various aspects of the structural state of a CFE polypeptide, NMR was used 
10 to provide the following information. The proton ID spectra and proton-nitrogen 2D 
correlation NMR spectra were used to assess the overall foldedness of a CFE polypeptide 
without actually describing in detail that folded state. Unfolded and substantially 
misfolded proteins produced distinct signatures in these two types of NMR spectra. 

15 The chemical shift of most protein nuclei in either the set {Hn, H a , Hp, C\ Ca, Cp, N} or 
the set {Hn, C\ C a > Cp, N} for perdeuterated (e.g., 2 H-labeled) proteins were determined 
. by procedures well known in the art that involve collecting up to 10 triple-resonance 
NMR data sets. The protein secondary structure was delineated as either helical, turn or 
extended (e.g., p-sheet) by measuring A(5c a - 5cp)> A5C\ and A8Ha where 6 refers to the 

20 chemical-shift value and A refers to the difference between chemical-shift values 
measured in this protein and those measured for the same residue type in a random-coil 
(unstructured), tetrameric peptide. 

This secondary-structure profile was generated in approximately 2-3 weeks per protein. 
25 The secondary-structure profile was used to confirm the functional identity of a protein. 
It was also used to refine the list of possible functional identities of folds, predicted by 
various computational techniques including fold recognition which is associated with a 
protein or polypeptide. 
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NMR was used to generate folds of proteins or polypeptides for which both no structure 
was known of a sequence homologue and no structural homologue was discernible in the 
PDB by fold recognition techniques. 

5 Refining a Structural Model 

Nuclear Overhauser (NOE) data were used to refine both homology-modeled structured 
and previously determined experimental structures of close sequence homologues. This 
process took approximately 2-3 weeks per structure. 

10 

The CFE 88 polypeptide was characterized by NMR analysis to establish its secondary 
structure. The NMR data was used to filter the computer-aided threading analysis. The 
NMR-determined secondary structure for CFE 88 suggested that CFE 88 is structurally 
similar to 4-aminoimidazole carboxylase. 

15 

The characteristics of other CFE polypeptides were analyzed by NMR methods. A 
computer-aided threading analysis revealed that the N-terminal domain of the protein 
EGA, which both binds and hydrolyzes GTP, was both structurally similar and 
sufficiently similar in sequence to CFE 52 to suggest that CFE 52 had a similar function. 

20 

The NMR data of CFE 103 suggests that this polypeptide is unfolded. Circular dichroism 
spectra, as a function of temperature, also indicated that CFE 103 was unfolded. 

The CFEs 2, 42, 43, 68 and 88 polypeptides were tested for their ability to bind potential 
25 inhibitor molecules by screening both the anti-microbial deck and the small-molecule, 
pharmacophore deck. CFE 34 was tested for its ability to bind potential inhibitor 
molecules by screening the anti-microbial deck. 
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NMR-based screening was used to measure binding against both the small-molecule, 
pharmacophore deck and the anti-microbial deck. Binding data from these screens 
allowed assessment of the propensity of a protein to bind small molecules. The binding 
data was also used to identify sites on the protein which are capable of binding small 
molecules. The binding data was also used to identify common pharmacophores among 
the compounds which bind. 

Reverse screening refers to a process whereby known anti-microbial compounds, the 
microbial target of which is unknown, are screened by a general method, e.g., binding as 
assessed by NMR, to find a physical interaction with polypeptide targets previously 
determined to be essential to the bacteria (i.e., the CFEs). The reverse screening method 
was used to determine which CFE polypeptides bind to which compounds in the anti- 
microbial deck. The reverse screening method included the following. The compounds 
in a proprietary compound deck were screened for Minimal Inhibitory Concentration 
(e.g., MIC). The compounds exhibiting antimicrobial activity were designated active 
compounds. The CFE polypeptides were screened to determine which polypeptide bind 
to which active compounds. The CFE polypeptides which bound to the active 
compound(s) were confirmed, where possible, i.e., in cases where an in-vitro assay was 
possible to construct, as being inhibited in their function as a polypeptide by the active 
compound(s) by examination of the inhibition profile of the compound(s) against the 
CFE polypeptides. For additional confirmation, the effect of the compound on the 
microorganism harboring the CFE polypeptide was monitored (e.g., whole cell assays). 
The structure of the active compound was used as a basis to generate chemically-related 
compounds by iterative synthesis. The chemically-related compounds were tested in a 
screening assay for binding with CFE polypeptides. The active compounds and the 
chemically-related compounds of interest were the compounds which exhibited an 
increase in binding affinity for a CFE polypeptide and/or exhibited drug-like properties. 
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The results of the reverse screening are as follows. 127 compounds from the proprietary 
compound deck exhibited anti-microbial activity. 94 of these active compounds were 
selected based upon both lack of cytotoxicity and lack of excessive hydrophobicity. 
These 94 compounds were soluble to 16 mM in deuterated DMSO; these compounds 
were also deemed to be sufficiently soluble in aqueous buffer for both the 2D and ID 
NMR screening methods. 

This subset of 94 compounds was used in an NMR-based screen to determine which 
compound binds to which CFE polypeptide. The CFE 42 polypeptide bound two 
different compounds with Kd's in the range of 0.2 to . 1 mM; the CFE 43 polypeptide 
bound one compound with K<j ~ 30-50 \jM; the CFE 34 polypeptide bound 13 
compounds, one of which inhibited the polypeptide function with IC50 < 10 nM. 

The enzyme assay used to confirm the NMR results which suggested CFE 34 interaction 
with the compounds included the following: 10 pM 14 C-Iabeled malonyl CoA; 20 (iM 
ACP, 30 pM CFE 34; 20 mM Tris-Cl, pH 8.0; 5 mM DTT; in the presence of absence of 
50 |iM of a compound solubilized at 40 mM in 100% DMSO and dilute 100-fold into 
10% DMSO and further diluted 8-fold for a final concentration of 50 in 1.25% 
DMSO. The reaction was performed at room temperature, the reaction was stopped with 
the addition of TFA. Two hundred jal of the reaction was injected onto a Mono Q 5/5 
column. The chromatography conditions included: A) 20 mM Tris-Cl, pH 8.3; B) 20 
mM Tris-Cl, pH 8.3, 1 M NaCl. Hold 10% B for 5 minutes, linear gradient from 10% B 
to 50%B in 10 minutes, back to 10% B in 1 minute, hold for 14 minutes to re-equilibrate. 
The reaction substrate ( I4 C- malonyl CoA) eluted at 9.9 minutes, the reaction product 
( 14 C-malonyl ACP) eluted at 14.3 minutes. 
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What is claimed is: 

1. An isolated nucleic acid molecule encoding a polypeptide which is (1) essential 
for the viability of a bacterial cell and (2) has at least any one of the functions of a 
pantothenate kinase, a Holliday Junction branch migration protein, a single 
stranded DNA binding protein, a phosphoglucosamine mutase, an 
acetyltransferase, an uridylyltransferase, a malonyl CoenzymeArACP transcylase, 
a 3-oxoacyl-ACP synthase II, a * 3-oxoacyl-ACP reductase, a 
phosphomethyipyrimidine (HMP-P) kinase, a GTP binding protein, a ATP 
binding protein, or a 4-aminoimidazole carboxylase. 

2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:97 or Figure 115 and wherein the polypeptide is a 
pantothenate kinase. 

3. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:35, Figure 60, SEQ ID NO: 19, or Figure 44,and wherein 
the polypeptide is a Holliday Junction branch migration protein. 

4. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:8 or Figure 33 and wherein the polypeptide is a single 
stranded DNA binding protein. 

5. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:3 or Figure 28 and wherein the polypeptide is a 
phosphoglucosamine mutase. 

6. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:82 or Figure 103 and wherein the polypeptide is a 
acetyltransferase. 
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7. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:82 or Figure 103 and wherein the polypeptide is a 
uridylyltransferase. 

8. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:30 or Figure 55 and wherein the polypeptide is a 
malonyl CoenzymeA:ACP transcylase. 

9. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:86 or Figure 107 and wherein the polypeptide is a 3- 
oxoacyl-ACP synthase II. 

10. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:31 or Figure 56 and wherein the polypeptide is a 3- 
oxoacyl-ACP reductase. 

11. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:36 or Figure 61 and wherein the polypeptide is a' 
phosphomethylpyrimidine (HMP-P) kinase. 

12. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:37, Figure 62, SEQ ID NO:48, or Figure 73, and 
wherein the polypeptide is a GTP binding protein. 

13. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:42 or Figure 67 and wherein the polypeptide is a ATP 
binding protein. 
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14. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:84 or Figure 105 and wherein the polypeptide is a 4- 
aminoimidazole carboxylase. 

15. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:48 or Figure 73 and wherein the polypeptide is a GTP 
binding protein. 

16. An isolated nucleic acid molecule encoding a polypeptide which is essential for 
the viability of a bacterial cell, the nucleic acid molecule comprising a sequence 
shown in any one of SEQ ED NOS:l-l 13. 

17. An isolated nucleic acid molecule encoding a polypeptide which is essential for 
the viability of a bacterial cell, the nucleic acid molecule comprising a sequence 
shown in any one of Figures 26-130. 

18. An isolated nucleic acid molecule encoding any one of a polypeptide designated 
CFE 1-1 17 having the amino acid sequence shown in SEQ ID NO:l 14-226. 

19. An isolated nucleic acid molecule comprising a nucleotide sequence which is 
complementary to the nucleotide sequence of claim 1, 16, 17 or 18. 

20. The isolated nucleic acid molecule of claim 1, 16, 17 or 18 which is DNA or 
RNA. 

21 . The isolated nucleic acid molecule of claim 20, which is labeled with a detectable 
marker. 

22. The isolated nucleic acid molecule of claim 21, wherein the detectable marker is 
selected from the group consisting of a radioisotope, a fluorescent compound, a 
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biohiminescent compound, a cherniluminescent compound, a metal chelator and 
an enzyme. 

23. A vector comprising the nucleotide sequence of claim 1, 16, 17, or 18. 

24. A host-vector system comprising the vector of claim 23, in a suitable host cell. 

25. The host- vector system of claim 24, wherein the suitable host cell is selected from 
a group consisting of a yeast cell, a plant cell, and an animal cell. 

26. The host- vector system of claim 24, wherein the suitable host cell is selected from 
a group consisting of an Escherichia cell, a Bacillus cell, a Pseudomonas cell, a 
Streptococcus cell, and a Streptomyces cell. 

27. An isolated polypeptide which is essential for the viability of a bacterial cell 
comprising the amino acid sequence as shown in any one of SEQ. ID NOS: 114- 
226. 

28. An isolated polypeptide which is essential for the viability of a bacterial cell 
encoded by the isolated nucleic acid molecule of claim 1, 16, 17, or 18. 

29. The isolated polypeptide of claim 27 or 28 which is a fusion polypeptide. 

30. A method for producing a polypeptide having the.amino acid sequence of any one 
of SEQ ID NOS: 114-226 or a polypeptide encoded by the polynucleotide 
sequence as shown in any one of Figures 26-130, comprising: 

a) culturing the host-vector system of claim 24 under suitable conditions so as to 
produce the polypeptide; and 

b) recovering the polypeptide so produced. 
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3 1 . A polypeptide produced by the method of claim 30. 

32. A ligand which binds the polypeptide of claim 27 or 28. 

33. The ligand of claim 32 which is an antibody or an immunologically active 
fragment thereof. 

34. The ligand of claim 33, wherein the antibody is a monoclonal antibody. 

35. The ligand of claim 32 which is a diazalactone. 

36. The ligand of claim 35, wherein the diazalactone comprises the structure: 

P 

N N0 2 - 
N r=( 

CF 3 

37. The ligand of claim 32 which is a JV-pr.otected amino acid. 

38. The ligand of claim 37, wherein the JV-protected amino acid comprises the 
structure: 





39. The ligand of claim 32 which is an azabicyclodiene. 
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40. The ligand of claim 39, wherein the azabicyclodiene comprises the structure: 




41 . The ligand of claim 32 which is an alkaloid. 



42. The ligand of claim 41, wherein the alkaloid comprises the structure: 




43. The ligand of claim 41, wherein the alkaloid comprises the structure: 




N. N 
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44. The ligand of claim 41, wherein the alkaloid comprises the structure: 




45. The ligand of claim 41, wherein the alkaloid comprises the structure: 
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46. The ligand of claim 41, wherein the alkaloid comprises the structure: 




47. A method for detecting the presence of the polypeptide of claim 27 or 28 in a 
sample, comprising contacting the sample with a ligand which binds the 
polypeptide and detecting the binding of the polypeptide with the ligand in the 
sample. 

48. The method of claim 47, wherein the detecting comprises: 

a) contacting the sample with the ligand; and 

b) determining whether a polypeptide-ligand complex is so formed. 

49. The method of claim 47, wherein the sample is a cell, a tissue, or a biological 
fluid. 



50. The method of claim 47, wherein the sample is blood, serum, a swab from nose, a 
swab from ear, or a swab from throat. 



51. The method of claim 47, wherein the ligand is a diazalactone. 
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52. The method of claim 51, wherein the diazalactone comprises the structure: 




53. The method of claim 47, wherein the ligand is a W-protected amino acid. 

54. The method of claim 53, wherein the Af-protected amino acid comprises the 
structure: 




NH 




o- 



55. The method of claim 47, wherein the ligand is an azabicyclodiene. 

56. The method of claim 55, wherein the azabicyclodiene comprises the structure: 




OH 




OH O 



57. The ligand of claim 47 which is an alkaloid. 
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58. The ligand of claim 57, wherein the alkaloid comprises the structure: 




59. The ligand of claim 57, wherein the alkaloid comprises the structure: 




60. The ligand of claim 57, wherein the alkaloid comprises the structure: 
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61. The ligand of claim 57, wherein the alkaloid comprises the structure: 




62. The ligand of claim 57, wherein the alkaloid comprises the structure: 




63. A method for detecting the presence of a target nucleic acid molecule as shown in 
any one of SEQ ID NOS:l-113 in a sample, comprising contacting the sample 
with the complementary nucleic acid molecule of claim 19 and detecting the 
binding of the target nucleic acid molecule with the complementary nucleic acid 
molecule in the sample. 
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64. The method of claim 63, wherein the detecting comprises: 

a) contacting the sample with the complementary nucleic acid molecule; and 

b) determining whether a complex comprising the target nucleic acid molecule 
and the complementary nucleic acid molecule is so formed. 

65. The method of claim 63, wherein the sample is a cell, a tissue, or a biological 
fluid. 

66. The method of claim 63, wherein the sample is blood, serum, a swab from nose, a 
swab from ear, or a swab from throat. 

67. A pharmaceutical composition comprising the nucleic acid molecule of claim 1, 
16, 17, or 18. 

68. A pharmaceutical composition comprising the polypeptide of claim 27 or 28. 

69. A pharmaceutical composition comprising the Iigand of claim 32. 

70. A method for determining whether a genomic nucleotide sequence of interest is 
essential for viability of a bacterial cell, comprising 

a. integrating an exogenous nucleotide sequence into the genomic nucleotide 
sequence of interest, wherein the exogenous nucleotide sequence 
comprises a portion of an open reading frame of the genomic nucleotide 
sequence of interest, and 

b. determining whether the cell having the genomic nucleotide sequence of 
interest so integrated is viable. 

71 . The method of claim 70, wherein the portion of the open reading frame comprises 
about 200 to 500 base pairs in length. 
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72. The method of claim 70, wherein the exogenous nucleotide sequence further 
comprises a nucleotide sequence conferring a selectable phenotype to the cell 
having the genome so integrated. 

73. The method of claim 70, wherein determining comprises selecting the cell having 
the genome so integrated in the presence of a selection agent. 

74. The method of claim 73, wherein the selection agent is chloramphenicol. 

75. A nucleotide sequence of interest which is essential for viability of a bacterial cell 
isolated by the method of claim 70. 

76. A bacterial cell comprising an exogenous nucleotide sequence integrated into the 
genomic nucleotide sequence of interest, generated by the method of claim 70. 

77. A method for determining whether a genomic nucleotide sequence of interest 
resides within an operon, comprising 

a) integrating an exogenous nucleotide sequence into the genomic nucleotide 
sequence of interest; and 

b) determining whether the cell having the genomic nucleotide sequence of 
interest so integrated is viable, and wherein the exogenous nucleotide 
sequence lacks an expression regulatory sequence. 

78. The method of claim 77, wherein the exogenous nucleotide sequence further 
comprises a nucleotide sequence conferring a selectable phenotype to the cell 
having the genome so integrated. 

79. The method of claim 77, wherein determining comprises selecting the cell having 
the genome so integrated in the presence of a selection agent. 
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80. The method of claim 79, wherein the selection agent is chloramphenicol. 

81. A method for inhibiting a function of a CEG polypeptide which is essential for 
viability of a bacterial cell, the method comprising contacting the CEG polypeptide 
with the ligand of claim 32 under suitable conditions thereby inhibiting the function 
of the CEG polypeptide. 

82. The method of claim 81, wherein the function of the CEG polypeptide is selected 
from a group consisting of a pantothenate kinase, a Holliday Junction branch 
migration protein, a single stranded DNA binding protein, a phosphoglucosamine 
mutase, an acetyltransferase, an uridylyltransferase, a malonyl CoenzymeArACP 
transcylase, a 3-oxoacyl-ACP synthase H, a 3-oxoacyl-ACP reductase, a 
phosphomethylpyrimidine (HMP-P) kinase, a GTP binding protein, a ATP 
binding protein, or a 4-aminoimidazole carboxylase, 

83. The method of claim 81, wherein the CEG polypeptide is selected from a group 
consisting of CFE1 -113. 

84. The method of claim 81, wherein the CEG polypeptide is 2CFE 34 shown in 
Figure 55. 

85. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64. 
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86. The method of claim 81, wherein the CEG polypeptide is 2CFE 34 shown in 
Figure 55 and the ligand is: 




87. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64 and the ligand is: 




88. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64 and the ligand is: 
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89. A method for identifying a ligand in a sample which specifically binds a CEG 
polypeptide, the method comprising: 

a) contacting the CEG polypeptide with the sample under suitable conditions 
so that a complex having the CEG polypeptide and the ligand is formed'; 

b) recovering the complex so formed ; and 

c) separating the CEG polypeptide from the ligand in the complex and 
identifying the ligand so separated. 

90. The method of claim 89, wherein the sample is a tissue or biological fluid. 

91 . The method of claim 89, wherein the ligand is an azabicyclodiene. 

92. The method of claim 91, wherein the azabicyclodiene comprises the structure: 



93. The method of claim 89, wherein the ligand is a diazalactone. 

94. The method of claim 93, wherein the diazalactone comprises the structure: 



95. The method of claim 89, wherein the ligand is a ^-protected amino acid. 




OH 0 




NO2- 
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96. The method of claim 95, wherein the ^/-protected amino acid comprises the 
structure: 




97. The method of claim 89, wherein the ligand is an alkoloid. 

98. The ligand of claim 97, wherein the alkaloid comprises the structure: 




99. The ligand of claim 97, wherein the alkaloid comprises the structure: 
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1 00. The ligand of claim 97, wherein the alkaloid comprises the structure: 




Gl 



101. The ligand of claim 97, wherein the alkaloid comprises the structure: 



N 



jO 
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102. The ligand of claim 97, wherein the alkaloid comprises the structure: 



CI 
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Gene Disruption Assay 
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1 OP Elf 'homologue of SEQ> ID NO. 1 H i 
AT 3ATT|rAKH3AGGMTItnTCCCG^ 
TT ITAGj LOCTAGOTOATCOACCTATTtroAT7XZATACAA 
AA MIJSTAGTTGOTGTTCATW^ 

TC ITAA K5AACGTATCATCATTACAAAGGGTGGTGCTG ACCGGAATACAAGTATTAAGAACATCATTGAA 
GC ZATf iATGCTTATCGTCCGCTTACTCCAGAGGATAK^TTC 

ac \ctt:ck^tgattcaggacaatattx^ 

GO XrrTUTACTATCGTTC^ 
TC iAGQ\CAAACACCK^AACATKX:GTra 
Q \ 3AACGAAATCHTOACAGATGCATO^^ 
GP rdAA rACXX^VAATCTGAAGATTACAACCGTAACAGATTTG 
GApTAG 

2CFto'"homologue of SEQ. ID NO. .2": 
Afl JGCT^CGTAATTATO 



C L iQ W TCATtGAAAG, 



fttAGGl 



l > Tqq 

^ CA< 



AA(A< 
GCIA' 




!TCTTAACCAAAT(XCTTCAXTTACGATTCC 
" ' " ' >CATraAOCGTGCXnTGAACGCTTCT^^ 



G rCGGCGAAAATGCTAAAGTtKXTOTC^ 
aai :aag ^AAAAGCAAAAGAAATCACTOAAGA^ 

CTTTTGGAAOTCTAA . 

'^Cfl9 "iomologue fjf SEQ. ID- NO. 3". 
AT? OG*[AATATITra<K3ACTO 

.TT/ AACl AGCACQTTTK3(Ud<^ATOTICrTAGTCAAGATG 



CC< GTG* taTOATOCTtCTAGC^^ 



TOM AOTCCAOAAiaGCm<KJAATrrroaTAQ^ 

en 3Trrt :aactggmcioctctt<u^ 
""'-fccc cGTt^TcrrTCCAcACcnGG^^ 

Lt£/ACCTrAATdTTGGTKaACAC^ 
TTG GTTTOGCCTTTCATCGACL^CAGTOACCGCrTGATrGCTG 
ATflQTG/ GAAQATiATOTACATWX^^ 



^^rTtnTOATGGATTACAATACCACAGOTOATGGTCAA^ 



TCACGTTA' 



hjo^mpiogue of SEQ. ID m\ 4" 



28/60 



WO 01/49721 



PCT/USOO/35604 



AjrcuAkMAjmcTAATre^^ 

A. iqGT 'ArGAAGTTGTMCTGCTTTTMTGGTCGTGAAGCQCT^ 
TV JTAjl TATTCTGGATTTGATGCTTCCAGAAATTGATGGTTTAGAAGTTGCTAAGACCAT^ 
^ 0< ^GTi iTGCCCATTCTTATGCTTTCAG CXIA AAGATAGTG AATTTGATAAGGn , ATCGQTTTGG AaCTTGGG 
^ G< :\GA [<lACTATGTAACGAAACCCTTCTCCAArcGTGAGTTGCAGGCGCGTGTTAAAGCTC^ 
• 6; TCXt AACCTATCCCUGTAQATGGIt^ 
VF^ A<?AAA nrGTTCCAGACGCCTACGTGCCTAAAAAATArc^ 
^ to .GOT TTGT&TCWriTAG<*CC«^^ 

G< fGO? ATG4jCTATTTTGGTGATGTCCGCACAGTTOATGTGACTGT^ 
AC >ATA< IGCCGkGCCGACCAOAGf ATATCTTGACGCGCCGTCW 



TCA 



2 QIS "homplogue of SEQ. ID NO, 5" 
Al GGMGAAAlPlxnrCTGTATTGGtroTOT 

CC CCCC/VGTTpGCACriXjAAAAAGGrniGGAGACTGGCXUAGTCT^ 
CC ACTA CAATGAMTCACAGATOTCCAGTTGACGGACXjATGATTTC 
QA CAG1 GATGCTnAGTGGTCAATGTCATTGATA TCTT^ 
AC GTTT CGTCJTCXjGGCWATOATQTCCTCnTGGTAGGAAATAAAAAAGATAT 
DC TOOT ^G,mAC^aT<KKTCATGAAACra^ 
TAA<Tr^GCAGAAAATAAAtATCCC^^ 
GCGATpiCTMGTCGTC^ 



o 



5!&A^ccrATTTm^ 

IT XJTt 'ATAATGMCTCAAACTCCATCGTAGCAAGCT^ 

TG jGAA ctc^PTGacaccaccaaatagcaAckiaaaaagaa 

2CF 5« 'Jiiomologue Of SEQ. ID NO. 6" ■< •' .' 

A I ! ^ItP 0 *^** 01 ^*^^ 
AO^g^TTCAC^TAAGMCKKXKSrariTCU 
TG< GG* TnGCACKmTGCSAAAGAATT^ 

or car CTctoCCAA(iAACKSGcurrTCGTnTTt^ 

AQ< 'ATOWTKWGTTATCGAAqATCUACXX 
GA*Cq<iCCATCTC(KK>AAC^ 
OG AACf nPCTmA<^lTCCTOT™ 
TO/AGC/TK^OOOTACGC^Aa^OV^ 

GAflGA'tm^CiCArATTOCCT^ 

^ATGCTTCTMTQATTTCklACGAAAATACTATCCT 
CXSrCGGTACXaAqC^^ 



GA< IOCIVaOA 
AG) TTG/ TG; 




AC^TCW^CITC^TTCATMGATOCGCM^ 

iCI^M'hoinologue of SEQ, "ID NO. "7" 
TTOjnraAj^TGATOATTTrAflTAGA^ 



29/60 



WO 01/49721 



PCT/US00/35604 



V 



V AT TOTT SGTOATAMCCAAAACTCTTOATTlnrAAACAAGGCOT AOO 

S A/ TGG( QTUttfrATTTTGAATCAC^GG^ 

V) U aAQ- TOTAAaGATGCGOX^aAAGCTCATCK^^ 

^ TCAGAini^C^GGTAC^TGATTATCGGGATOCAAA 

«" nSGClSGTAAAMGATTCKnX/TroTTGGAAACMG^ 

,-*\ AC AAAQATCTCGAAATCTTGGATACACCGGG ^TtCTCTGCCCT/UiQTnQAGGXlhj^^CT^m 

\i ^TT^GTTOGWTTGACTOGAGCT^^ 



<WAGa CCTCTGATTATTATAGATATOACCCXHrCKXXnrG 
TCrpTTXXn^GGAAGratiTGATC 

2Cpl£8 "homologue of SEQ. ID NO. 8" 

IACA 



^"homoibgrue ofi SEQ. ID NO, 9' 

. .^JATG(rrpATt™CA(3GGGCTOTTra^G^ 
Q^^AGCTPK^CGO^TATTCtt^^ 
AT<KTOTATC71CmTTGTTO^ 

A<J JAG(gtGaAAATCCAAGCAAQXtTATGGAA(XJtn^ 
^WilCG^mAGCTAAACGCATCKSAAAAAATC^ 

XJGG XKXATATOSGXAAATrAACAACCAT^CCTTCMW 

2^ ACA< KSTOGXJGACACGaTTTOTAOTTGCAAAAGAGT^ 

. A^^ACTAdAOATTrTGAACTCGCTC^UAA^ 

A ~ ^^TCAAATCOlWGGCTTGTTTtrAA^ 
A<?f GAG XttCTMGAAAATTC^GAAGAACtfCTC^ 



CO 



"homplogue of SEQ. jp NO- 10* 



>3 
to 



J- 



GTIGAA GAAAttKTWAA^ 



£CEElXjirhomol6gue of. SEQ. ID HO. JJ" '. 



GGC AAA/ .CGjrGATGCCAAGATTTTGAA' 



I^7XUTim^TCA(iiCKJCt 
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NO 

cn 



qAto4taTATM^^ 

TG Grf AC TGGACCAGCTATTAATGACTTGATTCGCTACATTC^ 
OT fAGC GAATATAAAGTCAAICTCAA^ 

CT :GTT rTGGTCGCATO AmTGAAA ATGCTATCC AMTGATGGAAGT 

AA AAAfl ATCtitTrATCTAGAAGAACXTTCT 
^ TA KACA C^GCCTTTAAAACAGTtnTTAAGTGGGAAGTATAA . 



AA/i 
AGO 



Do 



2 CFfell "hoinpiogue of SEQ. ID NO. 12" _ - 

AT SAACtrtAAkAACTACTTTGGGC^ . 
TC 3AAC TACGCTOC^^ 

TA :GAfi ATTCTCGTTCTCACTGGAACAAATGG . 
M 3 AGC TTTAiTGGltAAGTTCT^ 
CC rA AC iGCC^AATCTTCAAAAACtGGGA^ 
CG IXTC rGTGACTATATCCAGCXTrAGTtrrntrG 
OD rOAA ATCTkTAffTAGCTATAACAT^^ 
T\. \CGG \GAdAGTCCACTTTTCTAGAAGCC^ 
AA MAC GACdAGCOCAACTGGCTCACTACAATACCGAAGGGATiCT 
CA \aTA rGAGC^TAATACCTATGGAAACrr<3G<n^ 

[CTO iACTATMTTTGACAAAACTGGTTO 
AA 3A!\i VCGOTATC(7UVATtX3GCXKi^^ 
TT3 €fHV GGTCCXXJATnXKJAACTCATCAAA 
GA \ACC rTT^TATCGGTGACAAGGAATGTACOT 
CTi <TC& iAATOATCaaACTAGC^^ 
GG. VAITt JACACTAGCTOGATCTOGGATGCAGACTT^ 
AC< }dtai JCGGTXHTCGTCATTCT^ 
C^< TGAj lAOGAGrAATtnTiGAGCAAGTlXnCAAGAC 
CTC IGPAj ^CTTArACTCCCXTGCTGGAATTTTC 
TGAACTJlA 

ACJrtftf "homologue of \SEQ(. ID {JO. 13" 
ATC rGTTiATACTTCACTTTCCTCAAAAGATGG CAATTACCCCT CTCAACATTGCCCACCTCTACGG 
iTteW ATOiAATACCTACGiKWACAATGGAAA^ 



)P{ GTGACCGTTGACATCGTITCTCTCCATGATGACTTTGATGAAAATC 
TCdGXG< fTGGTCAAGACTtTGAACAAAGTATGATTCCAGACG^ 
A& ACT> .CATCCAAAACGACGGtGTAGTTCTGGCTATCIXj 

GAAXkJrrCAGGAAAACGTATtoAAGGGCrAGGGGTCATC . 
( :CGt3 TTAlfcGGrGAGATCAAGATICACAA 
ACC AAG< CCGTACCTT^TCTCTGATCACCAAAAACra 
CQ/ AQAs lAAGGTCGGTGAAGGGGTTCATTATAAGAA 
TCI CGTA WGCfcAAIX^GCmiCGC^ 

AAC TCCC TGCCTATGAGGATATCCTCAGCCAAGAAATCGCTGAAGAGTACAG^ 

CTCAcnnrcrtAA 

2CFM4 ; ' , ~- ... 

AT)C AATOTAAAAOAAAATAGADAAOTGTIT^ 
AQA QTGC rrcc^liCTCTCTCArrGC^ 
GCT M3G1 GTTCATCATaTCGGTGAAAATCGTO^ 
COA GATC tGACTTGGCATTIGATTGGTACCTTO 

Am< ^ ATC^XTGG ACTCAGTA AAGCTAGCAGt^ 
AGTlSTITPCTTGi^ 

iGTTAGCCAGACrAGATMGATTGAATATGTTGGTrTAATGACGA^ 
GCjKGCAlGTOA^AGITGA 
CAA VTTO XAATATGOCTATGACCGAGTTJOlGTATGGGAAfGAGTC 
TCG 3TTO JACTtTTOTKX^ 
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•0 



CA STA- 
CK rdA< 
ct<3Ga< 



TG/ 
ATC 



iciEiS "homologue of SEQ- ID NO. 15!' 

a' £3Gg1lattggtctagaaaatc 

& i ATG TrCTTTGACGATTCMGATGGCTCTrATA^ 
A< TAT1 TtACAACTCTTAAATGGTrTATTGGTGC^ 
O fctfri GACTJCTAAAAATAMGAfATTCGTCAAATTAGAA AAC^ 
G> AiAA rCAGATTTTTGAAGAAACGGTTTTGAAGGACGTTGCTTTTO 
AJ jATGCTGTGAAGACTGCGCGrcAGAAACTGGCTCTGGTTGGAATTOA 
W GTQ QTTTpAGCTGTGAGGGGGACAAATGAGACGT<^GCCATTC 
TC TATA TTAGTCTTAGATCAGC^AACAGCTGGTCTAGATCCrcTA 
TC AAA) AACTCCACCAGTCAGCKjATGACCATGGTCTTGGTAAC^ 
TC CTA/ TCAAGTCTATGTAATGGAAAAGGGACGTrT^ 
AC ACG f TGTTlTTATGGAAaAAGrn3AGTTGGC^GTA 
Qi TAGN GGCOTGTCATTTAMCmm 

2CJEA«> ■homologue. of SEQ. ID. NO. 16" 
AIQdAiATTC^ 
TG /MtAC TCITCGATGAGATTA^ 

GC AXA^CAC/^mGACCACGTAAGGTCAGCAAAATCm 
G ritC :CAGQTTTCCmCTAGCCGTGCCTTrCAGGCC^ 
OA CqTC AAGQAATCAAGTCATTTCTXnTAACCAGCr TT^ 
CA rfTC ^TGAGTITGAMAAGATTCTCrrGGGTAAAATTC 
GC AGClGGAGCACACTATTTTXTreTG^ 
GC YOM AAACTCMGGCTGCtGGTGTTCCGTra 

}TnT OGAAGACGGAACTGAAATCAAGGCAGCAGACTATAT^^ 
:TATrTTAGGAGACACTCGAAAMCGGGTGCCAOTGTGCGTCTGGCTGTCAATC 

rgag rccacttatggcaagggtgatgaaaaaattgctcgtaaccatc 
g^gtagcggtagaagcaggtgccaaacgcctcctactcaacx^ 

AA (Ufa/ TaTTAGCAAACTC^GAAAOACGCTGCCA 
OA &GAA GTGG AAATCTAG 



eCFkll . "homplogue of SEQ. ID NO. .17" . 
AF iAGT|^TA^CA6TTTAAC^CAClTtK5TC 
TTC rGAG. ^GT€^TTTTTGftTTGAATGTAGGGTTA 
GG' XjAT rCCAMCATCGATTACCTrTTTGAA AA^ 
AK SCflG iTGCC^TTGGTGCTGTAOCmTC 
AC< ^Tft fAGTteGCAAAGCTCTTTOTCAAAG 
TTC ATGV .GAATACGGAGATTGATTTTGGTGGGACAGTGGTTrc 
GA< JAGT 'TGGGAATTGTCirGAAGACATCGGAAGGAAGC^ 
A^C^prAGTGMTCTTATGCAACTGATTTTGCTC 

:ggccaatgcAqacagcaatattCaggtggctag 

ATtfA^dAAACTATTGCTOACTXto^ 

GCA< ATTTTTGACGCTGCGGATXAAACAGOTCGAOT 
dGTCC GCAQAGCGATTCGTCTTAAGAAGTTGTCTTTAGCC^ 
AT4TOTC TCGdlTTGAAGACCATG AGTTGAm 

01 AAGATOTCGATTGGTCGCCATCGTM^ 
GGTC CGTCTAtTGCTAMQAAGCCTTTO™ 
TTGjTQAfflTTGkTTACCCA^ 

CCTAAGTACCTCTTCCCTGTCrAAGGGGAGTATC^ 
GnGGGATGTTGCCAGAAGGGATCTfXZATrcCTAAAAAGGGGA 
A£^T^GACTTTCTTC£A 

tg a rorr sgamixhtottci^ 

CAE rCAA CCGTjpGTOAGAAGAAAATTGTGGCTAGAGCTCGTGTTCAC^ 
GA^ GAG! CGO^TATTCntXXSTGAAAGTTCAG AATTG 
AGA TGAC ITrGACTGiGCk^AGATCrrCAAAGGTAAGGTrCGTG 
ACqubfcGTCG|fcc^G 



<ft TGC 



32/60 



WO 01/49721 



PCT/USOO/35604 



l (Cf my "hpraqlqgugi._gf„SSJL. , ID . JIQ_. 1 8," 
ATOAC/!AAAd>ATTTCAfCATGTAACGGT^ 
AA GOT/ ,TCTA<XrTTGATtKX?A<HTTGGGCGaAGCAGQACATAG 
AA|AAG<JCCATCTCTaTGCCTTTGACCAGG 

rGGTGACCTTTATCAAGGATAACTrcCGTCATTrAC^GG^ 
CT{K3T$^CAGGAAATTOATGGAATTTGT^ 

mTPCfrATAAAMGGATCCGCCACTCGA<MTGCGGATGM 
TAtrGA^TGG^AATCATTATTjACTATCATGATTTGQTTC 



C* HAA \(^GAtTGCGCGTMGATTOAGCAAGCCCGT^ AAGTGAAGCa5ATTGAG ACAAC^CTGAOtT 
AG CAG/ GATTATCAAGTTOGTCAAAGCTGCCAAGGAACTCAAGAAGAAGGGTCATCCrGC^ 
TT^C^pGCTATTCGAATTGAAGTCiMiTGATGAACTGC^ 

3GATGGTAGAATlTCAGTGArrAt^CTrrC^TTCXJrTACSAAGACCGCT^ 
ATtTGT^tAAGGAAGCTrcAACAGTTGAAGTTCCAAAAQGCTTG 
M 3ATC GMTflGGlXnGCCGTAAGCCAAltnlGCK 
CApTCUpCCAAGrrGCGCGtGGTGAaAAAAATTX^CAAGCTTO^ 




ATG 
tga 



£CF22I "homologue of SEQ. ID* NO. W" \ . ■ 
Aff }AOf KGAATTrTAGATAArGAGATAATGCKH3GATGA^ 
TA' TTA< CTCHATATATCGGACAGGATMGGTG^^ 
TO :GGdATOAfVGCCCT<K5AT^ 
TG' TAT GCC^CaAACTGGGAGTCAATC^ 
GT< JGTA< iCTATTTTOAATGAGTTAOAGCCTGCGGAT^^ 
Gf < IAGT< KSAAbAGQTGCnfTATAGTGC^^ 
Off CAGT :GTA£fGTTCATTTGGAG^^^ 
eiXfCAA' XXG<?TACX!GGCACGTrTIt KK3ATr ACAOGCCA 

VTK iTSGAG<X5GACGGC^GATArrmGAGATGGAAATCA 
' tflTCGTGCtoACCCCTC^ 
T 3GGG[TAATl^TGATATrATTACCGATAAGGCTTTGA 
T0$ACTV LTCTGOATCAAAAAATCXiTrra 
- Mrrmitrrc^AATATCGC^^ 



j kG< 



OT TrATCATOCGGACACGGTCnrtJGACGGGTC 

xc^tgaaaaagcc&ccgcactcg^ 

2CB24 "homologue of SEQ. ID NO. 2D" 1 . 

ATCAGfttTdrrm^ 

TTC OTCGTGAXAAATATGTdCCTAATGGAGGCCCTTG 
CTI CGTT STAGACGAAGCiACTACGTACCTrGATGGATnXX: 
QIC AAA> AGGQATGACCAAAXKjGATGGATCGTCGTGOT^ 

GtA CQAJC TGTljpGTGATGCGGAGACTGGCM^ . 
CGI TGCC CACQOTGGTCGIXSGTGGACGTGGAAATATIX^TTK^ 
Alt TCTG &AAATGGAGAACCA<K3TCAGGAACGTGAGTTH^ 
<KH TTAA rAGGATTCCCATCTGTAGGGAAGTCAACACT^ 



TGG*AGAA< 
CGA3ATGGGT- 
GTCATGAVA' 



jGGA 



CAC TAGC OGACTIXKX^GGlTroATTGAAGGGGCW 
m ~'~TC3AGCGTA(*CGTGTr^^ 

"TA CCTAGCTATCAATAAAGAGCTCGAG 
TGT UVCT UtAAGATGGACATGCCTOAG^ 
AAATTAT UTGAAmGAAGAarTAO^CKTATCrnnrc 
ACApnTrAGATGCTACAGCTOAATTOXtA 

* ~ * * - ACTATGGATTTGACGAAGAAGAAAAAGCXnTTC 
^TGGTGAAAMCTCATGAAACICIT^^ 
OSltAGCTTCGTGGTATC^ 

~ cakJAtrGcrrAA^ 
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^i.ji!bonu2if>.gye. pf.SEQ. id ..NO- -2,1" • 
AT&AAC tact^mtgttgqgaaaatcgttaatacgcagggattacagggtgagatgcgaOtcttgtcto 

TCjlCGCATTTTrGCAGAAOAACGQTTTAAAAAAGOAGCTOAOCroG 
TC*qCAU(*eTGACCATCGC^^^ 
TAeCAtVTCA^TACTATCGAAAAGTAC^^ 
i"^ CTAGA? SATGI^miAATTTTACnVVTCACOAGAmTteGTTlPGGMG^^ 
<^MR TCAAOGAAATCCTGCAACCAGGTGCTAATGATGTCTGGGTGGT^ 

ATlradcnrnAprrATATCcciccAcnGGTTCTCAATGT^ 

A iMGffiGTrAGACGATGAAGATCTC^^ 

'homol ogue of SEQ.ID NO. 22" . 

AdATTCATATTTnrAAOXrrCTTO 

wAGA \AAA"GGGCTCTTCGATAltX!AGTATCATAATTTrCGAGAAAAro 

OApOAT 5AGC]IXTAGGGAGGCGGTCAGCM^TGTKXntAGAGTACAACCTATT^ 
CTATTG,UAAaAAAAATa:GCGCGTTAT^^ 

^^AnTC^KnCAAGAGGAAGAGCrAATCmAlXnG 

A A\^CCrTGGTAACAGATGACUTITCCCTAGGCGACTATGTTCT^ 

. ATteAt QATGAaTGOTCACCATGAAAAGATTCGra^ 
CGAGCG ^GGCCGGATT7A£nTGAACATTATCAACTGaCAGTAG^ 
C^GA^CAMGMG«K^^ 

2Cf , ip7 "ndmo'logue of SEQ. ID NO. 23" 
ATAA^AAdCAAGTAAATTAAAAGCTaGTATGACCTTTO 
TCjAAOn-AGTCACCACAAACCUGGTAAAGGAAACACGAT^TCCGTATG^ 
^ CTdGTO TACATTTGACACAAGCrACCGTOCAGAGGAAAAATTTGAAGA/^GTA 

fcCTt TAGTPUTGTTGAAMOGAATTGCtrTACATOOT 
fc£* «T^AC^A<*CraTIXn«aTAAAC^ 



^Jra28( M hpmologue of SEQ. ID WO. 24" 

CTC^CCTMOATiaUACX*^^ 
WtGOX^CAAACTCAAGAMGMGAA^ 
^GCI^TTOAOCAGCITAAGAC^GGACAA^ 

^AtACTOOKHnXXJ^CA^ 
: I^CCAAATCAAATCITCaTCT^^ 



ATTJ 



MDC^TATfGCrGCTGOTTCroOAAATA^ 

GC CTAAAC^GCTCATCX^CKJGTCmiGTtlXXkJSATAT^ 
1TC(IAaATaACC7TOCTA AAAATATrtft?'* a ATArnnnln^\i^^^^^i^!^^^^^^^^^ 
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TOATG 
1X> 



(MAAA' 



.ft 



2 cr£ * 



GaCAAGGCGGTATGCCTGACTTATCAGCTCTC 
jTGGCGGTTTGAAAGGTAaaATTGGTGAATTO 
rGMpAAAGCGAAGAAGAAACGCAAGGCGGCCGCACT^^ 

''homqlogue of SEQ. J® NO. 25" 



G^GCC 

ice 

Cl TT 



GT/l' tnTATTGAAATmAAAATCTATeTTCTTCGGGATTGTO 
rtCCAt ITAC^GGT^CXTCUTrrrAGCAGAGGAGm 
T< CAT* fTTT/^TGTCGTGATTCAGCTTGG 

c( mm aaag€gactmggacaaac^^ 

C( ACip TGCCaTrACTTCGTGTClTrAAATO 
f] GCTC TCATpTrGATTATfoACGGGGTTGCCTTC^ 
" TCC LAGTGTAACAGtAGTTGGAGAAGCTTCCTTATACGACCGCT 
TO TTTACCAGGGACTAGCCGTTCAGGTGCAACGATTC 
TH TGAGAGAATTTACCTTCTATCTTG^ 
TI3 GTGAAAGCCGGAGAACTCTTGAGCTTTGGGCAATTGTTO 
CI nfttji XGIClGCATGGTGGCTATTCXKnTCTTGAOCAGCrATO 
<K rc*AU TaCGGTATCGTGCTTXKJTAGTGTTTTCCTACTTTACAG 
C/CCACCACCACGACTGA 



2cipE30< "homologue of SEQ.ID NO, 26" * 

SdGWTTATTTOACCGTCTATTCGGAAAAAAAGMGAAOT 
SGA^AATCTTGATTTGTCTGAAGATGTTGATC 
^GCA< AGG^GAAATTOTTGAACAAGCTOTGrrCCM 
G7 nt}GA.TTTAGAGCGAGTTGTAGAAGTTTCTGAAi^^ 
GAATACrC^GtrTCTAGAGAC^^ 
AG AAG/ AACXJOTTCAGGAAAAATATGACCGCAGTOT 
OA AJGC CTITCrrTOCtAAGTlC(XkrrCTC 
TC ^TG^TGTIGGTOXXrAAGlCGCTTCT 
TOCGAAbAAACCTOAXt^ 
h AG^A<^ATGAAAGCATCCACTIXX^GATA^ 
<K 2f ^ C ^^rATCGGAAMCTA(X^ 
^ CX< jCAffimcbTTCGGTGCGGGTGCAGTAGC^ 
AGjrAAetrGGAbcTGAAAAAGCTGATCCAGCC^ 



-T#A< 



:atv 



r4 



iGA 



ijteoTAmraAAcott^ 

IMAATXKXXHAGTACAGGCCAAAGA^^ 



utggj 

^ 'GTC^AAATCCCCTAGTAGAGGCCiMAOAATTrrc 
TO+C^GATraATGOAAC^^ 
AAUT^lATTGOTTmKnXJAAAAAATCmTOATATTXra 
CT^TTCt AAGGTTTAATCGCGGCCGCACTCGA^ 

31 • homblogue of SEQ. ip NO. 27" 
'TATtoGAAATGGTAOATGAA^ 
«C ^ TCK^GCCCAAAAAmbOAAAAGAAGACAAGOAaATGG^ 



GAOTTT 

2CBEB, ' homolog'ue' of SEQ,.>ID NO. 28" 7 
^ TOCpACffAAA^TACACGTTnTKnxnTGAAAATGCX^ 



'ATS 
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« QM< SAAAttGCAGAGCTAMAGAATTTTCTCGT^^ 
TC CCTI GGGTGCTGTCTTACTCATGCAATTTC 
Vj tl GAG( TCCAJ^ATXUCMGAA^^ 

CAGAG<STTCGAGAGTXX^ 
^ ClGACirimTGGTATGCGCM.TCCACTCAGITC 

G( CAA' t^GOGAGAAGGGTTGACCCTCCGTCATGACTCCTATGATCGCATCTCC^ 
XT TTGC GAATjTAAAGAAGTTCTCAAGCGTCATXjAGCTTGT^ 
GC ACW C WCpCCACTXjA 

£CT£to< "homologue of SEQ^ LD, NO^-29 n 

v. Al ggc^aacXaacaagatttcatcgctaaagtagcagaag ^ 

y) GC AGC/ GTTGAAGCTGTATTTGCAGCAGTA<*CnX3AGTA X} 
Gl TCTG jTAACTTTOAAGTTCGT^ a t 

\< OA CAA1 tocagcttctaaagtaogagcattc^aagciggta Lfl 

CjACCACCACCSACCACCACTGA ■ 

. 2 OF 834 "homologub/of SEQ. ID' NO. 30" 

AT 3ACT MAACAGCCTTTTTATTTGCTG^^ q 

AT 2AQ1 VPCCGATTGTT AAAGAAACGATTCA^ T 

q\ FCGA rACGGAAGAAGACAAACTCAATCAGACCCGCTATACGCAACCA fT 

CCrATC rACCQTTTATTGCAAGAAAAGGGCTATCAGCCTGATATGG TA 
CP 7TGC nTGCfrTGGCAAGCGGCGCCTTGGAlTTrG^ 

x TA'rATC}AAG^GCCKOTC^^ 

^ Tt< ?AAG ^GCpTCTCAAAA AGGTTCTGAAC1T ^ 

M CCGT ^TTGCTGGAGAAGT^TTGCAGITGATt^ 

r <X CGi Jim ^riXTQTCTTAAGGT GTCAGG TCCCTrre - 

V-K TG, iAkC rCTGGCTCAGGTAAGTrrm \ A 

T6< IAAA lAGAOGACATTGCTCAGCTCTrGACpCGTC^ T 

TGGGGT ^TC^toAGAAGCAGQCArAAGCAACTTTATC j^p 

Or 'AA/L ^AATTGATCAAACrGCTCACTTAGCTCAIXJTCOAAGATC^ j, 
AA. IAAC rcGApCACCACCACCACCACCACTGA 

ton 35 M "homologue of, SEQ^-ID 35 

atcbmactaga^cataaaaatatct 

f(flx2AAPCAGGAGCqAAC^TTGT^ 
r >TGGTATldAAGpTGGrrCGCAT7^^ 
ATK tATPC AT p^CknrATlxkA^CTGGGTTC^ 
AT4CiT^ATCCrc^ 



CO 
GC 




ft .™.„ 

_ TATC ACASCAATCAGTCTTGj 
OTI GTTC GTITJiATGpGGAATA 
^ CA> ,GTC] GTGGOACGCG AGGTC _ _ 
Tqi GATA TGA^GCTAlCTfATGAGA 

. rrcqc5P <wCAGAq<^CMrtG^ 

Gkl TpCC AT^TCG^G<^^G^A: 

2CFE36 i "homologue of . SEQ. ID tio.. .,32" 
ATC CKJ^dTOAAAMGAAAGTAAAGTTOACTAG } 
COdCTAd^^^ 



- -jJ^TTCGCTTmATtXjrnttaT^ t 
COTK^GTCCTCTTGCCAOTCT^^ 

^E£rfA^TA]mAAAGAAATM^^ 

TTC ^5; ^^GCTTTOTCTOAGCOAAATOOCG^^ 
TTT^AT<^TTTGCAGTTTATGCGCCAG<nX]GAGTCGCCCTATACre 
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X rGCA VACCTATTTCTTGAATAATCCATTCAAGATTATCG 
£ ^^g^TAGAAAMOAAAAGC^ 



noc 

CCAl 



GTCA1 
CO 



5) • 



2CfE37i|'homologue of SEQ. ID NO. 33" 
A' PGAA 3ATT^rAAGAGGCACTtATTAAAmTTCCAT^ 
T GATGTGGIXnATTCQACCACCAGTGCTATnTAAn 
AMCGUGGlUTCTTrrGGATlGGTAGTTO^ 
T< iAQA. VATOAGCGACTAATCATTTTAGTTATATTAATAGA 
Q< JTAT TCAGTAAACOaGGCATACGOTTCKUTITCGGTO^ 
17 AAA ITCATMTTATTTGGTATTTAGCTCACCGATTCTC 
T, TTC/AGTT ITGACTCAAAATCAATGGCTTCCCCGTGCT^ 
Qj lTTG/ iAAG ^GGGAATTrTCCCTOATTTAGGAAATGCGACTATTT^^ 

2q{83Si : Li "hQ PlQjLo. gpj5 . of . SEQ. ID NO. 34" 
AT aClt MAAtTTrAACCnTAT^GGTfTTCXKSGATrATO 
£ ^ ^^ a A^ rcA G^ATTTTAGTACGTGAATTTG<XlATCGK>TATC 

V, CC TTCKK **AAJfoTAGGACTACCAGAAACGGCAGAAATTACCA^ 

JS, IH£ ^GC^GGTrAAGTCAOAmTCTATC^TGmGXAGaTGGT^^ 



tttttaoactogagga 



.( CACCACCACTGA 



ATCT^alCAt 
TTGSTTArATO 
GtfGCATCAGC 
. TTC [TAG PCTi 
CTGGCTT3G' 
AACAGGCCai 



honiologue of SEQ . ID NO J 35 " 
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840 "horaologye of SEQ.. ID NO. 36" 
G(SCTA( :CTAT[A£CTTGAACGGCTTGCATGG GTTTGTAGCAGTG ACTTGTTTCASGCCnGA^GA^A^ 

CC CCO GGAC^ACCTGTGGTGTTGGATCCTGTCnTGGTCT^ 
AC CJCl GQCAAGAGtTGATTCGXnTTTrCCXOTATdTCAGTC 
nmArcraGTCAGGAAATTAAAACCTTGGAAGACATGAAAACTG^^ 
r. Ir.rn&n GTCATTATC^AGGGAGGC^TCGTCITAGTCAGGACA^ 
HACTATCCTACiAAAATCeAGTT^^^ 

-x.— i- 'GCCAGlGACTTGATrAAAGQTGATAAACTTTTQQ^GCAGTA^^ 
2.CFE41I. "homologue of SEQ . ID NO 37" 



^ GAGCCKICA< 



TC G AQ <jA< 
GCCTClAGCA> 




orr< 



AAjCCGT 2GTA< 



GCpACltJTCA 
^ ArCCAAbA' 

K TOTOGtrcA- 

M\GA( 



^ Ai 



V0 



■TC^^CK;CGTX^GGCAGG<KK)AATTGOTKXCGTCK}TXXrrGG 
OTApCGlTCGCAAXl^7t^C(K3A 

TCAOAGAAAAACGTTTGGAGTCTACKl^CTTTTAAnATTnrtTTTr! a > , 



CTGGAGCTG 

sagaaaaacgtitggagtctagc^cttttaagat^ 

_^^CTATCATOAACAT1CTTOA<^GTAA0Aa^ 
GGATG?3ACAACCMGAGtAnCATCTC<X}AGGCAAC<nx^ 



"homologue of SEQ j ID NO. 38". 



^^ ( ^ TC0ACMGAA0TO ^^^ 

AJU k TCC( 'CTAT!gCTOAAATGGTCAA^^^ 

TO CCA1 CCAA'GAMTTOGrcttOGACOMGAAGAA<^ 

"hpmdl.o£fue-Qf .^Ep. id no;- 39" 



ATC ACCflAA' 
tTArGW5Alt< 
CcriGCA 
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AAOJ! 



GuAAA 



it 



2CfE44 "homologue of SEQ.. ID NO,. .4.0" 

AT(trTAAn'ACAGGGGCAAATG<jCCAATTAGGAACGGAACT^ 

iaatacgifgggagragatgtggctgagatggacattaccgatgc^ 
gxqptgamccgactttagtctaccacnxjtgcagcctacaccgctgttgatgca 

agttggacttcgcgatcaatgtgacggggacaaaaaatgtcgcaaaagcatctgaaaagcatg 

GtGC^CTCTAGTnATATTTCTACGGACT 

A< rTXG 1 / .TGACCQACCAGATCCACAGACAGMTATGGACGC^CTAAGCGTATGGGGGAAGAGTTA 
QJ AGG iTGTGTCTAATTTCTATATTATCCGTACTGCCTGGGTATITGGAAATrATGGC^ 
TI ACQ TGCAAAATCTTGCGAAAACTCATMGACmAACAGmTAAATGACCAGTACGGTCGTCCGAC 
TI GGA( TCGTACCTTGGCTGAGTTCATGACCTACCrAGCTOAAAA 

TG TGA> ATOATGCGACAGAAGACACAAC^TGGTATGATTTrcCAGTTCAAATTTTGAAAGA 
CC AAG' CAAGCCAGTAGATTCCAGTCAATTTCCAGCCAAAGCTAAACGTC 

CljGGCC AAAGCCAAAGCTAGTGGATTrGTTATTCCAACTrGGC/UVGATGCATTGCAAGAATITrACAAAC 
AA( JTGAG ACTCG AGCACCACCACCACCACCACTG A 



AAG 



2CFE4! 



„ It 
^ 08 



GG3. 



i: 



" homologue of SEQ „ . ID. NO. 41 V 
ATteAAXCGTTjCTCTCGACI^ 
OI GGTC GCTATCTATATAGCCGTTAOTCATGATTATCCCAATAA 
OG CGTG GATTQCCTTGGGGCTTGTGAm 
TG ACCf CCTTTCTATATATTITAGGCTTGGGACTTATGATC 
IT jCAT :AACtiGGTGCCAAAAACTCK3GTATCMTAAATGGAATrA 
GA AGA3 ATCCTATATCCT GATGT TGQC^^ 
CG CACC GTTCCGCTGGACTTTTroTTAATTT^ 
GC \CTT 3AAAf5TCACTTGGGQAC^GCTTTOGTITIT 
Gp ITTQ [TGGAAAATTATTATCCCAGTATTTGTGACT 
~ LTTA< 5CAAGGACGGAGGAGCTTTTCTTCACCAGATTGGAATGCCGAGCT 

:rra ]ctcaato:ctttgaotttgc(Xaaacaacg 

3AjGl GfGTGjfiCTTATTTTGTCAGGGATTTAATGCrt 
Ttlpf ACGGTTATTGCAGAAGATTTTGGC^ 

Q(i rrx* xx3tatgttgaagattactcttaaatcaaa:tmcgagtt 

—> iTGA' "GTTQCTCtrcCACATCTriGAGAATA 

:< :ttc|( ctttcatttcgcaagggggatgagct^ 
ah jagt racc^gactaatctagctgaagmaagagcggaaaagtccc^ 

TTV AAA< ^AAATTAAACTCGAGCACCACCACCACCACCACTGA 



55 



2CP±4« "homologue of SEQ. ID NO. 42" 

A^gaKaaatcat(^aatcacigggcks^ 
ag kpttpcaagcagtggatgcxxjacxk*^ 

Off ICTA OTACAG CACITroOGGAAGAAATCATIttTTGAAAAC^ 
""^^TGTTTrCAAATCCTGAAGAGCAAAAATGGTCT 

rGGTACrTTGAGAGMCAGTrGGCTCAGACAGMGAGATTnTT 

caggactacagcgattggtttgctgagacttggtto 

GAAGGCTAATGAAAAGGGACCAGTIGICCAAAGATG 
TA.C AAAi ^AMGAMGATTTGGGCAGCCAGGTTtnTGATAATAATG^ 
AA< TOO TATCCTlXirrGAGGQAGGTAGGCAAGATGACAGAGATtnXXi 



agoaac 

TGAG 

GCrAGTrtTCA' 
AGGM< 
miGAl 



GA 



$1 



LCfPGC 
*GCA 



homologue of SEQ* ID NO. -43' 



ZCVEii 

ATCAGAMAAAT 1 rOTrATCAATGGTCGATTACCAClCC^ 

GTC r«}T PGCCCTAAfTC^GCTATTATCTTGGCTGATGATO^ 
CGG AljGJf AGCf* "~ " 



TrcCCTTA/ 

iJAGCcAGTCTlGTCOAAATCATC^ 
£°£ a ^ 0A ^^^<rc ro ' I tCA^ 

TAC WSJ TrAT^GOAGCCTCTTAGGGCGTTTTGGTGAAGCGACAGTTGGTCTACCGGGAGGATGTGATCT 



GAT^CAIGAAjQmTCTGCTAMGATACAGGACTTCA 
TGGpAGC^(^tTAATACGAtGATTCCTGCGGrrAAA(KMAATGG 
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J Cfg Y7 (to***) 

CfcGTG lACCTQAQATTATTGATGrAGCTACTCICrTGAATAATATGGGTC 
A Ztkk rATCATCATTATTGATGGTGTTGLAAAGATT^ 
^ rX}AAK^G<!iAACATATATAT^^ 
\A ' a WCA ZCTGCjAAGGGTTTGTTGCTAAGTTGGA^ 
^ \ TmGTCGAGGAACAGTCTAATITGAAAGC^ 
C GATT TGC^(^COK:TTACCCCTCrm 
TV iOG/L l\AACGTGTAAATCATGTTTTTO^ 
f <V k! ATT] TGT^CACGGGTGGACGTGATTTACGTGGG 
IK fCAC AGTCMTGCTGGGCfj^ 
& iTTA" TCTGATATTATCGAAAAATTACGTAATT^^ 
k( ICiiqt ACCACCACGACTGA 



o 



'homologue of SEQ. ID >NO. 44" 



A.' GTCrMGAATTGAATTTTCACCATCT^ 
T1 T6A^ TGATAAAGTAGCATCTTATGATATCGATATT^ 
CI CCTT [KHT£ATTGAAGAAGTTCAA AAAA 
CC AAC< TTTTgGGTAGATCAAG 
TC GTC1 rGCrmCGTTTGATTGAT AAAATI€ATC 
A/ AC/V JCTQTTICTACAATCrrrrc 

CC TTTT jC^GGACAACGCTTTTTGGAGTCTACCTTGTATAAAATCCAAGAA 
CA ATG< TTATCaCTACaTCATFOU^ 
OC AGGV OCAC^AmATGTTATAGGTC^ 
CC OAT/ TCTGTTCTAGAGATrACGAAGA^ 

ac:aoc^ccactga 



2Cf|e49 , '"homologue of SEQ. ID {JO. . .45" 

At 3 agaI^tatggctitgaCagcaggtatcg 

CA ATTA ^VAAjAGCAGGAGCAGAGGCAGCAAAGTACCCATTTO 

TG jAAC hxxiagatgaacgcctacaaaaactaactgaaatgataactccxaaaaag 

CA rXTG ^TTTACAGATAtTGCAOCMMrrGTAAAAGGAGCTTCAAAAGGAGAAGGGCT 
TC* [TpG( X^AAtATTCGTGAAGTAGATGCGATTGTrCACGTAGT^ 
GG W 2AAGGACGTGAAGACGCCTTTGTAGATC 

tt< :rra< i^ag^aoaatcagtgaacamcgatatgcgcgtgtagaaaa 

AT, lAAG ^T^GTAGCAGAATTCAATOTTCTI^^ 
CTC :GTA( 'CArTGAATTAA(^GATGAGGAACAMAGGTTGTG^ 

ag rrcn tatgtagctaatgtggacgaggatgtggttto 

AT CGIX iAAT^JnGGAGCGAGAGAAAATCCTOA^ 
CTCfAATi^GATCATGAAGAT^ 



ATCG 



GCWCATi 



ACCAi 



Off TGG> kCTTlICAAACGTOOTATGAAGCCTCCTCAAGCAiQCTOOT^ 



AG( !TQGi ^CGCtTO<X}TGAAGAAGGAAAAGAATATATCG7TC^ 

AA'XJTCCTO3ACK^CCAO(^CCACCACCACXX3A wu«i« V uM«,in 



"homologue of SEQ. £b NO. 46" . 
jTGGAAAAAACCAATCGTATQAAT^GCto 
^TTAT^TOiAGCTttACTACGCIQATCATTAC^ 



, . 'ACTpGGACTATATTGlXrGCAGTXkG^ 

rrcjJCAC|A<^GATAGAAATmAACMCJCAm^ 

IC^Ifckl i ': '"homologue of SEQ. ID NO. . 47" 

OTG 3TATrCGTAAGCAATATCGTAAGO*AAATAAGC^^ 
^GCC^TT^GA^Tt^GA^AAMlWrCGTOJTC 
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it 



. ^ tt lCtGA rATTGCTGGCTTACGTGTGATGGTrcAGTTTGT^ 
vi> A< IAAG 2GTCAGGATATGCGAATCAfACAGGAGCGAGATTACAT^ 

G' TCC ATCATGTCGTAGTAGAATATACGGTTGATaCCATCAATGGAGCTAAGACTAT 
TC AAA* TGGTACTTTCKtfCATGAATI^ 

i ,TtTt ccaGatgagattaaoaagggactxkjaaattacagct^ 

" a: GPG' XiAAATTCGTGATOATATCCAACUA^ 
ACpfe* rTAGljjAAAGAGTOACX^^ 

ICtESl /"homologue of SEQ, ID NO. 48" 

V) ggaActtmtacacacaatgctgaaatcttc 

Kt CTGCCAGAGATraCC^AGCAGGGCGTTC 
CC GTiV GAATCTCGCCCGTACATCAGGA^ 
0 / CAA< >ATGC&CrTTCTC 

Q< GQG" 'GCATOATTGAGGAOTACTTAACGACTC^ 



ATO" 

rim 



A/j/AGTi 



s 



5v 



[TGACCCGTCAGC^GATGATGTC^ 



TC GljCA 

at tgtc gcoa { ccaaogccgacaagattcctcotggraaatggaacaagcatgaatx> 
a/ att/ aactttgacccaagtgacgatttcatccttntitc 
ctfggg|atgcaatcitagaaaaattggc 

•I : . 

i0nfe53 "homologue of SEQ.. ID NO. 49" 

^ AT Saaaacaagaaaaatccctttgcgcaagt^ 

O TC :GCA rTGTGAAGAACAAGGAAGGACAAGTCITrATO 
csv & ?ATX fGAAACTXGACAATGCAGAAGCCCTAGAGGCGAAAAAGA^ 
\£ CC\TG?AA(nXK3AAGAA^ 

AGfTTGO 3ACTTGAACTCGAGCACCACCACCACCACCACTGA 

aqrtlS4 "homologue of SEQ, ID NO. 50" 
U At 1 nTAAAACCGTCTATTGATACCTTGC^ 
IN Ak VACG rGCCrACGAATTGGMGC^GGTGCCCCAGCAACrrc 
jfcv C{J :GCT TAGAAGAAATCGAATCAGGAAACGTTACAAITCA 
^ Gjr<5COT!X5CCGTATCGAAGAAGAAAAACGCCGCAAAGAAGAAGAAGAAA^ 

TCX 7TAA U3AAAAAGAAGATGGTGAAAAA ATTCTCGAGCACCACCAGCAOCACCACTGA 



SS «. "homologue of SEQ, ID NO. 51" * 
TCAtTAACATCAjC^CAACGTGCCTrc 
KfcGA ^AAATGGAGTCAACQACCAAATCAAAACCAGCOTCCGTCAAGCTCrTGATGCGCO 
GQ" TACTCTCmCAAAACACAGATGAAAACATC 
QTGT GGATACAGTCGAAAAMTAGGACGCATCTTGATrTO 
CAA< f ATTTCTMGAAAGfCAAAGAAATCCTCGAGCACC AC 



i 



^'homologue of SEQ. ID NO.. .52" * . 

A7X)Ck?GATTOAAAATTATATAC& 

LGGGJUTCAAGGCrarTTTGOTOGAITTO 

AACGCO .GAGATGAAGCAATtKJCTACATdA 
AA( ACa AAA^CGCXJTTCAACGAGCAGTTGAGAAATT^ 

\C iTTT^TATI^CGiGTOCTATOAA 
CGWCCAC CTCATCACAGA^ 

G^CjCAAC ATGACTCAATOiAAAOTCAGATrAACCGAACrCGTGAG 
CGG^jTCGATTAGATATAAAAAAGGAATTC 

'homologue of SEQ. ID NO. 53" 
ATPlrrrCpAAAAATTTTAATTGGCAATCGfGGTG 

GCGACGGTAG^GTTTATTCAACTGCTGA 
GCApTTTbTAT^GTCCTGGCAAGGCAAG^GAGtCTTATC 
CTI^ACtpAGGfjAGAAGCTATtCACCCTCKi 
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GTIQAA( fMGTAGGTATCAAGTTTATCGGTC 
TQ CfCC TGCTCAGATGATTAAAGCAGGTGTGCCTGTTATA^ 
QA A^lGCT^GATTGTTGCTGAAAAAATTGGC^^ 
Gl AAAC GGATTCGTAAGGTTGAaAaaCCAGATC^ 

CA A<3G( caattatggcaatgGtgccatgtacatagaacgggttatctatccagctcggcacattg 
ca aatc ctagqtoatgagcatggacatgtgatk^ 

ac cam aggttttgaaagaaagtccctcgattgcaatcggaaaaacgctgcgtcat 
ct 3<7tg rtcg^gcggcagagtttgttggctatgagaatgcaggaac(^tto 
aa gtac aaatjjftctatitcatggagatgaatactogtgttc^ 

TC «3GT 3TTtt£TATCGTTAAGGAACAGATTT(^ 
AT *TTC rcCTACGCGGTCATGCC^TCGAGTGTCGTA^ 

AG TCCA GGTAiAGATTACTAATCTCTATCTGCCA AGTGGTGGAGTTGGCTTGCGCGTXKJAlTCAGCAaTlt 
AT :GAG3TT^CCATTCCGCCTTATTATGAt AGTATGATK5CCAAAATCATAGTACACGGCGAAAATCG 
TjT TGA< -GCCTTC^TGAAAATGCAACGTGCCCTCTATGAATTAGAGATTGAAGG^ 

ga rriro :agcttgatctcutttcagatcgc^ 

aa cert ^ttacctaaatatcaagaaaaagaactcgagcaccaccacgaccaccactga 

^ 2 CF ssfcv "homologue of SEQ, ID NO. 54" 
AT 2^^ACAMGTITITrATCAAGAAACAAAAGAAOT 

' " AG vcatcgatgccagctcagaacttgaoggccgtatcactgciggccaactt 

iPA3TACAATATCGAGTATATC^ 
Gp 2TTC< JAAATTACGGAGTTCCTCGAGCACCACCACCACCACCACTGA 



Si 



ATOAAG 



Ok' 
GG 



8. 



"homologue of SEQ. ID NO. 55" 
. . PATAOATATATITrAQCATTOAGACATdCtGK^ 
ACt rATG ^GCTCTTGTCCAATOTCATTGCrAGTCA^ 

'l@TAGCCAGTCGT(^CCATCTCGAGGTCATTACAG^ 

kC(:GAAGAGOACX)TOACAGCTOTTOCGGrrACCTACCOACCAOGCnTGGTCGaACKrCTTGCTAGTT 
itTTGVCAGCTOCCAAGGCCTTTGCTTGGGCTCACGGAC^^ 
pa LCCK JATGGCAGCTCAGAGTGTGGAGCCTTTXKUGT^ 
AC iCAG. VOTTCKnCTATGTTTCTOAGGCTGKjrtXJATTACA^ 
T<K (GGA XJCTTATGACAAGGTCGGTCGTGTCATCGGCTTGACCTATCCT 
CK iGprCATCA^GGGGCAGGATATTTATGATTrcCCCCGTGCCAlGATTAAGGAAGATA^ 
CC1 TCTC ACKJTTTOAAATtnGOCTlTATCAATCT^ 
JM «AGA< LGATTTOTOTGCTrcCjTCCAAGj^GCAGTTATGGACAT^ 



~ * «J AGC "AACroAAATCACAGATCXCAATGTTATCATroCA 

COiAGlk TTGCCTTTGATACCA'TGaAACTCaAGCACCACCACGACGACCACTGA UUA ^ 1 ^ ,lft ~* AA 



iCjrtjO 1 ^'homologue of SEQ. ID NO. 56" ... 
ATC ^TdQAATTCrrGGTCTTQTTGGAAACACA 

J7C \A7A CCGTGGCTAlXMTTCTGCGGGAATTITrGTOCTAGATGGTGCTO^ 

GTIJOGTC □TATraCAOAATTGTCTGCCAAGACAGCTGGTGTTGAGGGAACAACTGGTATCGOACATACTC 
JaJ^KAA^OATACGGAAA^ 

^^OCAAprGTTATGCGTAAACTCATTCAAG^ 
CCTSCTATCATJAAOTCTGT^ 

^S rc % rAA0AAAATO ^ GGAA °AAT^^QA^^^ 
GTCpGG^ACGpTATGCCACTTlCTCAGCAAGAAACCAC^^ 
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r4 



so 



C :0GA rAGTXrGTCAAQTTTTGGTCAAGGCTAATGAAATGGGAATTCCAAGCTTAACAGTGACAAACGTTf: 
CAGQTrcAACCClCTCACGTCMGCCMCTATACCATGCTtiCTT^ 
A' fCAA ^AAAGCCTATACACfcGCAAATCGCAGCCCTrGCCTTCC 
Gi }TAA rGCTiMAGCGCAAGCCTlTGACCTGGTTC^TGAATC^ 
C "CTTI CAOXGAAAGAAACCATTG A AGCC^GGTTCGTG AACTTC^ 
Q lTCG( iACGTGGTCAAGATTACTACGTAGCCATGGAAGCAAGTC^^ 
Ct .GTG ^AAGX/TITIGCXiGCAGGAGAACTCAAGCACGGAAC^TTGCCtTGA 
G' XSm GCTOrorTOTCAGATCCAG^ 

G' XJGTf SCTAAGG1XX7TCACTATCGCAGA AGAAA ATGTTGCTAA AGAT^ 
r< TAG/ .CCCrrACCTCTTCACCAATrTCAATC 
A< -CGTGG^CTX^TGTGGAT^ 

2CIB« "hbmologue of SEQ. ID..NO. 57" 
AlGATiCGTATCOAAAATCTCAGTO^ 

TC CATC GACCAACAATTACCGG<^TC^TTGGTCCAAACX3GCGCTGGGAA^ 
0< TQG< AAT^ATCCCACATWGCrrCAGGCATTTCTCGATOAC^ 

K1 rqcc tatgtcoaacaaaaaatgaatatcgactacaactttccx^tcaaggtc^ 

li 2?* T ATrtCCCTCTArrCCTCTCTTTO}AAGTTTMAGGC 

0C TTGA AATCGTCGGCCTAGCTGAGTACGCrGAACGTCAAATTAGTCAACTGTCTGGAGGTC^ 
CC GGTC rTGATTGCGAGATOTTTGGTGCAGGAAGCCGACTATAlX^TCTTGGATO 
TGACTCrOTCAGTOACKiAAATCATCATGAATACGCTGAGAGATTTGAAA^ 
CA ^GirCACCACGACCTCAGCAAOATrtrOCACTACTirXiATCAAGTtnTAOT 
ATI raCCrmWTOUACAAMOAAAC^ 

TtT (TCA. ITOG^GGTGACCTACTCGAGCACCACCACX^CCACCAGTGA ^"AMUiiAAreAACTCT 

.2C?E« ( "homologue ' of SEQ. ID NO. /.58" 

ATKXGlyVAdAAGTGAArTrAACAGGCGAAaAAGTOTC 

OA TGTT -ATTTIGTCCATAAGGCCTTGGiCTATGCTGTTQAATG^ 

^G^rTATATCATTCAC^ 

GT^r<mJOATrCTTCCATGATGTCGTGGAAGATACAGAlGC^ 

7T< X3TO rKLAraTGCGGATGATTOTT(MCGGAGTTACCAACKnTGGCAA 

CP \QCA \TTAGCGGAAAATCATCX3CAAGATGCRIATGGCCATGTCTGAGGA^ 

AG. ^AGA TTTGtCITrCCGTTATCTCAATCCAACGGAGTTTTACAAGA 

CGI X3G 1 iagcgtcaggccttggtogatgagqtagt^caaaattagagga 

AA< :OGT TGAGGAAATCTATGATCTGATTGCTATTCGTTGTAlTrrAGATA^C 
AKr^amCGTXKJATCSAATnTOGAAACCGATOCCA 
a6 CC^^GGmTCAGTCrATCXUTAOGAC^ 
AA< :CAA< toAAATGCACGAGGTGGCTOAGTACGGGOTTGCGGCTCA^ 
GGOGCA IGn'AACAGGAAGGAATOVGCTATTGGAATGAACTGGATCAA^ 
0£ GGC! t3AT(UTCCTAA(KlAATmiGGACTCT<nTAAGGAAAA^ 
^^^A™*GeTGT^^ 



AOAAG&CCA' 
lOTTTTOSGGi 



Slim! 




AOACCAC TACTCTACK^CAAATAAGGAGTATATGGCCCATATCGATA WSSS 
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dTCn}3AAc|G^TGTACTC 
C^AACGATA'tGAAGTTTGCTAATATCCATGTC^ 

"ArAAAATTAAGAGTGTGCCAGAAGmACTCTGTCVAACGGACCAACGGCCTC 
.ffiACTQA 



crc& 



ACOA| 

Am 
rrAc 



Po 



2 CPiSi 



GA5 



Ji 



: "honiologue of SEQ. ID NO. 60" 
ijA^GAAGAAATCAAAAAm^ 
OA jGCTCTTCGTATGCGTCCAGGOATC^ 
rfTGd jAAA^TTGATAACTCAATTGACGaC^ 
G ^GC& ^G ATOATTCG ATTACTCTTGTGG ATOATGGGCGTG 
G fCG^:CTGCTGTTCAGACCGT(n^ 

G 3TTK JAGGTGGTCTTCACGGGGTGGGGTCGTCAGTTGTTA^ 
P :CATE/ AAAACGGTAAGATTCATTAOCAAGAATACCGTCGTGGTCATO 
IX SGAQ aACGGArAAAACAGGAACAACTGTrCACTTCACACCGGACCCAAAAATC 
A iTCS TGATTTTGATAAATTAAATAAACGGAT^ 
C ATCi CTCATMGCGCCAAGGTTTGGAAG\AAC 

c< rrtok atatatkxaogagaacaaggatgtaatcttt^ 
<?j kTAT( awgttgagotagccatgcaatac^caa^ 
A' wire atacacatgaaggtggaacgc^tgaacaaggt^ 

Tl ATGC TXXjTAAGAATAAGITACTCAAAGACAATGAAGACAATCTAA^ 
<STAA< TGCACOTATCTC^GTrAAACACCCAAATC^ 
AJ TAGr( IGAAGTGGTCAAGATTACCAATCGCCTCTTCAGTGAAGCCTICT 
C/ CA& TTGCCAAACGTATTCTAGAAAM 
<K GTO/ AGTCACACGTAAAAAATCTGGTnXK5AAATTTC 
~ TAA^ AACCCTGCTGAAACAGAACTCTTtATC 

CGT^ ACCGTQAGTrTCAQGCTATCCHC 
<iC ATAJ GATTCTAGCTAACGAAGAAATTCGTAGTCt^ 
TXrGAT ITTTCGAAAGCCCGTTACCAAAAACTCGTmC^ 
TTCGTA XCTTCnTnAACCTTOATTTATCG^ 
CC SAAf C^CCAATCTATGGT^^ 
A* 0 A A> TCAAACTCCAAGAAGCTTTAGCCCGTTATAGTC 
TA \GG{j GCTAGGTGAAATGGACGATCATa\(OTG 

cg ctag agtt?ctqtagatgatgctgc^gaagcagataaaatctrro 
cacktcf sgtcgtgagtttatcg^^ 
aozaccactga 



^hbmologtfe of SEQ ID. NO *, 61" '.'.-* 
AlCGGAttTTACTGAAGAAA^ 



^ AGGTTACGTATT 

%Cfl 66 'libmolpgue of SEQ.. ID NQ %l .ft2" 
£™G]^CTATC(*^^ 
4 4 ^MTGTCTTrTOAAAAGATGATCAATC 



AATATAyr C GA 



AAAGAAjCfA 



ATT GTTO AAGCCTATTrrCGACAAGanrCAACGACM 

CTjT rGAA gtcaaggaaacaaaacaAaa ACGttxto^^ 

CTA^^TOCQG^ 

WATGOATATGAAATCAAGGCrGGTGCCTrccCXC^ 
lTTAGC3TGGTAAAACAAGACrcGAGCA(^CCACC^ ' . ■ 




2CFEJ7 /"liomologue pf SEQ, ID NQ.. 63*' 

II.- 1 " 
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6J 



rs AA 



DO 



ATbGCttoAmAGTAAAMAGATAAGTATATTCGMTC^ 
AA 3CTA \OCCAGAGflTTCCAQATOAATTATTTTCCCAGXtn^ 
CG^tCT JGGAAGTOAACGTATCTGTC^GCACnrCTAGCTATACCTTrt 
i XjATTGaTATGGGAACCTTCAAAGAATIXjTTTAGAGGGATTG 

taccaaaagaaactggcatctatgcgtgaaaaaacaggt^^ 

TpC rCTTATTAAAGGTCAGACTGTGGCTCTTGG^ 
GricG^CTGTAGGTGAAAAAATGACTCGlTrc 

AT QVC kGCCTCTGGTGGAGCCCGTATGCAGGAAGGAATCATGAGTCTC 
gc Igcg STTAAACGCCATTCAM^ 
GT jTCA ^GCTTCTTTCGCTATGGAAGGOGATATC^ 
GG ]CGJ :gTGTGATTGAAAATACGGTTCGTGAAAGCTTGCCTGA<jGA 
TA< ?AAC VTGGCmGTGGATGCTArTGTCAAAAGA^ 
r X3C^ 'GGAGGCSAGTCCTAGACTCGAGCACCAOCACCACCACC^ 

aC^:6? M homala^u^ of SEQ. ID NO. 64" 

Art ^(jAXtta^ 

Ca< JCtd lAGGGCTTGA AATCATCCAC^TAA ATGAAGAACA^ 
AA TGG' TGAtACTTACAAGGTGGAACGATTTGTAGTGGGCTraCCTAAAAA 
AC( :GCG< XSTAGAAGCTAGTCAAGCCTACGGACKSAAAGCTAGiA^ 
CK( IQMi JAACGCnXjACAACAGTGGCTGCTGAGGGCATGTTGATTO 



AA< j'CGC U^GAAAGTt^TTGATAAGTTAGCAGCTC^ 
TCC AGCaCCACCACCACCACCACTGA 



2 OT69 
ATGA| 

■ TA<fi 
CA' 
GG 1 



o 



Tcjr* 



AAA 



"homologue of SEQ. ID.. NO-. 65" 



Tl GAAAGATGGCGTAATCACTAACGATAACCGTATC^^ 
CAiCGAJ .CAAGGTGGACGTGCAATTCrTTTCTCn^ 

'AAA' CACHlTGCTCCTOTAGCAGCTGACTTGGCAGCAAAA 
CA<tTCGfl GGTGCTGAATTGGAAGCWGCAATCAACGCTCTT^ 
Ad CGtJ AajAAGATGTraACGGCAAOAAAGAATtrrAAAAACGATCCr^A^ 
TG/> CTTG GAGATbGTATCTTOGTAAACGATGCATTCGGTACAGCTCAaATa 
GT/ TGTC AGCAAACGTTGAAAAAGCAGTTGCTGGTTTCCTI^^ 
AGC!A<3T] GAMCTCCAGAACGTCCATTCG^ 
GTIATCC AAAACTrGCTTGAAAAAGCTGATAAAGTXXTTATCG 
AAC fCAd AGGTATCGAAATCGGTAACTCACTTGTAGAAGAAGACAAATTC 
1X1/ AAAi X3CAAATGGTAAATTGATCTTGCCAGT^ 
OA/ GTGC ;GTGACACTGAACK3TGAAGCAGm 
TCC CCA/ ATTIXSAOTAAGCTTT^ 

AA* CCti GATrTCC^AGCTGGTACAATCGGTGTGATGGACGCTAtCGTG 
A AT CATC GGTGGTGGTGACTCAGCTGCCGCAGCGATTAAC 
AG7 ACGC GTGGTGGAGCAT CAAtGGAACTTCTO 
AAAAACf] CGACkACCACCACCAOiACCACTGA 

i Crab* . "homoloctue- of .SEQ. ID NO. 66" 
ATG TTAA AATCAGaAAAACAATCACGTTATCAAATGTTA^ 



AA^CCA4TGTTITGGCTAATCITTC^ 

TATITCTOOATCKIAAAGGAATrcGTTTTAGG 
TAlfrQCAfcrAGGCM^ 
ACE A0C7 ATGTCUACTATATTtCTTGTGATAGTCTAGCT 
ATG 3TCA CnTAvJTT<k>AGTTCTGGA 

V [TTG( }AACi(ATTTGTCGCTATITPGCTTC 

It oca ^GCACCAObiccACCAOCACTaA 



'SLCF$ f V "homolojgpe.of.-S^Q.. IE> NO. 67" 
0. ATG VCAAfTCGAACTATTOACTXXICTrW 



IZAAqTTGGGATTTTAGGGQGGAATrrc 
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4 



i CTV.16 



rACGGiAACAGTTGGGA^ 

lAAA :CaTCCCTGAACACCATCGTC1^^ 
M TGA> ACCAnGAGTTGGAGCGCAAGGGTATTTC^ 
AC \'Mt( CAGATACGGATTATTACTTTATCATTO 
As> ki TTQ/ TGAAGTGGTTGACATGGTTCAGTTTG^^ 
TJ\ T<£4GTTATCTG(K3TGGACGTACCa 
AC GXC( aAA^CCGMCTTTC^ 

Gi« GCA( CACCAGCACCACCACTOA 

i . . 

aCl|E72 "hoinologue of SEQ . ID NO, 68" 
Al OAATjATTGCAAAAATAGTCAGAGAA^^ 
GC CATlirTGATGAATTTATCCAATTACATGGTGACCGrrCTT^^ 
TiA np<S rTGGfcTTGGAGACCAAGCTGTAACAGTGGTTGGTATCCAAAAAGGCAAGAG 
CC rCAA\CGGAATrrraGCCAACCACATCCAGAAOGC^^ 
TP AGA/ ATTtCGCCGTCCAGTTGTGACCTTTATCAATACAGCAGGTGCITATC 

0/ ACG1 GGTCAAGGGGAAGCTATTOCTCGC^TCTCATGdAA ATQAGTGACCTGAAAGTrCCTATTATCO 
CC fMTA rTATCGGTGAAGGTGGTTCAGGCGGGGCTCTGGCTCTAGCTGTCGGGGACCGTGT 
GC AAA^ TTCTATCTATGCCATrCTCAGTCCAGAAGGCTTTGCn^ 
CC VTpG \AGCAGCAGAACTOATGAAAATCACTrcGCATGAACTGTTAG 
TG \Trf nX>MGTAGGACTrTCTAGTAAAGAACT^ 
<M CTAf ACTTTCACAAAAACCGCTAGAAGAGTTG 
OG \QC^CCACCAO^CCACGACTGA 

2CF275 "homologue of . SEQ- ID NO. ?J" 
AT 3TCAbATAAGAn"GGCTTATT^GAGGCTX^TTTX3ATC 
XiGGCGACklAGACTTTTrGAT 

ATC< jAAaAtCCHAAACGGGGGC^AGAAAAGGCT^^ . 
TCTCAT( ^TGAATTGGTGGTCQATGTTGCAAAAAGATTGGGTOCT^ 
AT* KXsji JGGATTTGC^TATGAAGCCAGTTITGATTAC^^ 
ATJTAT] TAC^TAGTCGACCTGAACATtHCTAT^ 

GGA rArTGCCTGCTATGTTCCCGAGAGTATTTGGAGGAAAGCGCK^ , 
.(pCACX3A . 



"homologue of SEQ. ID ; NO • 72" 
AT<pAC0ATTTTCTrrGTGGTTAtCAGTO 
AO iGCTi uUAGTGAAGGAGAAAAATTAGCTGAGCAGTATGCAGGATTAGAGC^ 
Xft Tf C}fl ATGGCTTCKjAAlXrn'ATrACAGCGTrCTrGGTCGTAATAA^ 

ga' tgg p 'aaagatoaccataagatitaogtttatcagctaaatcagggta 

aa( )qqt tctaaggaaaagggagctggcgagattgacaagataacctxtggtcottatcaag^ 

aa' xttck igtagttaagtcaggatctgattt^ 

GAC rGGCJi TACftXJAGCACCACCACCACCACCACTGA 



£CFI7g "hopialogue.^f .SPQ. ID NO, 74" 
AT( TOTA CAATCGATAAAGAAAAATntHGTTTGf AAAA 
CGC C^GC ATATTCTTACTGGAAATC^GTGTTTAAACAAm 
GG< AATC TTGGTAGCCATCATTTTOATAAGT^ 
TC^KK^ GGTAAAOGACTTrAGTGTTra 

o TAA CGGl AAATyGCntjmGACGGTOTCTGGTre 
CAC TGA1 TAAqlTGGTTATCGCJTGTTTTlXjTtXj 
ATP AtGC AAGTuTTACAACGTCATCTCAAA^ 

p> CGjC AQC1 GGATTCTGGAATCTGATTTITGCCATGAGCGTAAC^ 

K GTG rGCA AXTC^GCGCTATCGTGACnTC 

Ga!a GAH OTTCCCUAAMTATCATGCCTGMTrGGTATCTG 
CAA 3CTT fATCTCATACGAAGCCTTCTTGTC^ 
GGTpGTT^ATyrCGGAmTTCACAMACGTAACAACCAATG 
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O A 
J5fc> AC$AAA 



3 ere 7? (co***) 

JVaiijrOT^OTATCCTTCTCCCTm 
R AT/iGA(?rCGAC3CACCACCACX:ACCACCACTGA 

' !l ' . ' ' 

Cp!79. "homologue of SEQ. ID NO. 75" 

.jrc iTATj LACCTATTATTAACCATTtTATTAGTATTATCTGTTGTGATTGTGATTGCAAriTrCATGGAAr^A 
^ ' U UCCMTCCAGCAATGTAmGATGCCAGrrCAGGTGATTTGTTTGAACGCAGTAAAGCTCGra 
G/AGCTGTAATGCAGCGTTTGACAGGGArrTTAGTCTTTTTCTGGCTAGCCATTGCOT 
AOpGJVj rTATCAAGTAGACTCGAGCACCACCACCACCACCACTQA 



2-CFfSO' 



o- 



hojiologue of SEQ. ID NO. 76" 
ATC ITTTC GTAGAAATAAATTATTTTTTTGGACCACAGAAATTTrACTCTTA^ 
AGAC^TGOjtUTCTntfATT^^ 
CGflCrtXTTTAnATTTGACAAACCCTAm 
T G$TAXTTAATTACCnX}TGrACTTTGGT^ 

A' , CAGmTCTAGTlT0ATrATATCTAGTCAAACTAmATAGTCOAGTA^GA^^ 
ACjTATCTAAT^ATCCTGCGCTCCAGAArrTGCATGTAGAAGCTACAAr^ 
GT GAiy .TTCTTCA^TATCCTAAATAGCGTATCAAATAGTGTCK}GGAGCGTC7TC 
TGT1 TTGAYriTGATTATaACTCC^OTTTTTTIXjGTTTAT^ 
IXCTO AAA^CGATICTAAAGAGGGAlXXCrrG^ 

,TW AGO' rrCATCCAATCACOATTTTAGTTTTACTriTXSTTOTeAAGCA 
ATI OTGC C^GTGCCAACCAATlCTATXnTGAAAGAAAm 

TAJ AATy ATG^GAACGAGAAAGAGAATTAGCTAAGCTCGAGCACCACCACCAGC^ 

&CFI81 "homologue of SEQ. ID NO. 77" . 
ATCTATdA^ACrTTATCGAAAATATAGAAG3^ 

^CCCGTGTGC^CGlTrrGAGmAAA^ 

„ ri _ rAGA AAMGAAAATATCAGTICTGAACC*GA^ 
GttjOGAATOCGGGACGCCTTC^ 

CTC CTAT nXTTGAAGAA^TTACTGGCLACCATTARrvrTA rr>A r^r^Mt^k^P^^^S^^I^^^^ 



TATA' 



}-£ TOT 3ACC 3ATCnTTGCA<P r * T * AraanarniYM-r a .i-r^m^ .^"^^^""'^ww-AiUACTOaTrT. 



AO 
GAG' 



ATC 
£ GCT 



1TCA 

CTvf, 




AGCAi 



fjCGGA^GC^G^CGAGCAC^^ 
. i "homplogue of SEQ.. ID NO; 78"' 
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j? ere $ 2. (cA**iJ 

AF CCTC AGAAAGCTITAAAAAAGTTCTAAATCCTCATCATTTr^ 
Affi IAGA ^TTGGACAAACTCTTGAGCCAGmG^ 

ATI TCOTTACXCTTACTrrcCTDTTGCGAATCAAGAAGOMCCAAGmACTA 

AA^nrCTATpvaGAAATQAGGTCGGACTCTTTG^ 
l£ S™™ ArcArcAAI ™ TGT ^ G ^ GA ^ 

M: CAG' TTAATATTTTGACTGCTOATTACAATrACf!TTrtTTf!TA /4nv^iim .V;. »»«£A£^I<GriGW 



™ TAT ™ ACWTGAmC ^™ 
Ct\C ATTCGGCTAT^ATAATCAGTTTTACGGTGGTATGAATGTA^ 
AGGrcGCTGAGGAGTATGAAAAATACTTACAAAAGTTTAATCCTCA^ 

GTAGCACTCTAGCAGAT<K:TAGTGCTCAGATGACn-GCCC^^ 
mTCCATTATCTrTATtMirfirtAArr^^ 



TAT G. 
TO/Ai 
AVTGi 
•AtCTr 



ACAGGTTTTAACTG' 

rAaixn-OATrnAAA 

ATOTGC^ATjCTTCCTCATCGCCTATGTGCTCATTT^ 

GO AATjl iCTCGAGCACCACCApCACCACCACTGA ^ TC ATrACTTCAAGAAGTTATCGCAAGATTGT 



'homologue of SEQ. ID NO. 7,9" 
'JTQG^GOACAGTTTTCTCAAGCAGGGaT^ 



5ci 

A" 



^^y" ho J^ol6g\ie of SEQ. ID NO. 80" 
A .TG A^"^* * ~~ " — * 



Q 
V 



<4 ^GFJEIS 



raOCTAACmTAGrCAATCGCfcS 



AxcpTtitTG: 



CACf 



ITCPTCT TOrTGTTAAC^CCTCCClATO^ 

.a^*at*acaaagaaacggK 

^9#CTTACTX*GGC^Ckrc 



CCAC 



homologue of SEQ.: ID NO. 81 
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(posset) 



c4 



•AAr 



canto 

^ TTtfAAG 



AAaTCA 
C/UtAC 



\ CACfc 



AAjLGrt<JGG<j( 



-AG 2CGA \GTGGATATTGAAGCGATrAGTG A AACG ACTGTTGTCAAAGCAMTCAACAQGTAGTAAAAGG 
Cff ETCCfi JMAAaAAICAATGATTTGAACGAGCCTGTGAAGACGGTC^ 
CA PGTG jTTGATGCTATTAAAAAAATAGAGGAAGA^ 
AT< JTTAi ^CATOaAAGACATGCCAGCACTATCTTAGAAG 
TT( IAAA! rCGAGGAAGCGATGAGGGAAGAAG(^GGCGCTGATGACCTTGAA^ 
AA ^GTCUGAACTAGAAGACTTGGGCTTGAAAGTTGAAACGM 
[TGGCltATCrrCAA^^ 

'AGCATCAATCTACAAATTGACACCAACGMCCAGGTCGTATTATCGGCTACC^ 
teCTTGCAACTGrtGGCrCAAAATTATCTn 
CAAl!GT0AAT<&Am 
Of TXGSC lAAGAAGGGCGCAGTCAtAAAACAGATCCAATGTCAAATAGO^ 

AF 'ATTJ CVVGGTATGGATGGCGTGACTAGTtACTCTGA AGGTGATGAGCCAAATCGCTATGTTGTTGTAG 
ArACAOAACtflGAGCACCACCACCACCACCACTGA 



AGi AG! 



^OT;8< ^homologue of SEQ.. ID NO. 82/' 

AT< 5TCA> u^TmGCCATTATTTTAGCAGCGGGTA AAGGGACnCGCATGAAATCTGATTTCCCAAAAGTm 
GCj lCM jCTTQCGGGTATTTCTAT^ 

ACi LGfA lcagttgtaggacacaaogcagaattg 

AC X&ft. ttf G^CAGTTGGGAAOTGGTGA^ 

G\( :a'CA :cttggtcattgcaggagatac 
CO TAT( aatcataaaaatotggccactatcto 
ati gttcgtaatgacaatgctgaggttcttcgtattgttgagcagaaggatgct 

K 1 '^GGAAATCAACACTGGAACaTACGT^^ 

tit lAAT^CGClCAAGGCGAATACTATATtACAGACGTCAl^ 
- *t< 'GCGuTTATACTTTGAAAGATTITGATGAA 
CTG/ lGTX^GTTATGCGTCGTCGCATCAATCAT^^ 

AGft IACTTATATCGATATTGATGTTGAGATTGCTCCGGAAGTTCAAaTCGAA 
Aa; &GC ^AACGAAAATTGGTGCTGAGACTGTTTTGACAAAd 

GG> tfCA KJAGCGGTCATTACCAATTCTATO ATTGAGGAAAGTAGTGTTGCAGACGGTOTOAC^GTCXKIT 
CCI TATC CTCACATTCGTCCAA ATTCAAGTCTC 
A<5C ! ATGT TCAATCCGTGAGAATACCAAGGCTGGTCATTTGACT^ 
AM :GTT/ ATTTlCGGrGCTCKJAACTATTAGAGTCAACTATG^ 
,GA( IaCa, tTGTCTTTOTTGGTTdAAATTCAACCATTATC 
G<t1 GCTf GTFGAACTATTACTAAAGACGTGCCAGCAGATGCTATrc 
ATA AAQi £GAATAT^AACAC<jTCTTCCTCATCAT^ 

CTC A ' ' i 

&CJPE87.. ^'homologue of SEQ. ID NO. 83" 
ATC TCCMGATTCTAGTATTTGGTCACGAAAAT^ 
CTA CCTT 3GAAAAGAAGOTACGGmGGATAOifGAAGCTGrrGCCOT 
GCC TTTG ICTTGAACtATTTTGGtGTGGAAGCACCACGTG^ 

AGC AAG1 taicttgactgaccacaatgaattccaacaa 

CGC TtjiTt 3TACFACCACC^CCGTOtXK5CTAACTTTC 

2^ ^GCGlXnTCAATCGTrTACCGTATGTTCAAAGAA^ 

CAC G1TJ GATOCTTTCAGGTTTGATT^ 

AAA ATC/ TrGCJlXXrrGAATTGGCTOAATTGGCTGGTO 

AAC CTCC TACGAAC1TOGCTAGCAMTCTGOTGAA 

caa CGGA aataatgtccgtgttgcccaagtgaacacagttgacatcgctc 
£EE ^g^^gcaatgcaagctgccaacgaatcaaacggctactctc 

ATA rcpt :AACrCAAACTCAGAAATATOGCTCTTGGTGC^ 

taJi ^gav^ck^aatacgctcgagcaccaccaccaccaccactga A1 



homolpcfue of SEQ. id NO. 84* 



49/60 



WO 01/49721 



PCT/US00/35604 



I* 



GO' *GGA IGGTCCCTATCAGTCTGCGGTTAAAAATGTTGAGGCTGACGGCCrAAAGGAGAA 
CC< flTW GCCAATGGCrTGGCAGCTTTTGAAGAGACTGACCAAGTGTCTGTCA 
GG' XJGTi JGrrf GATTGCTAGGATmAGAAGAAGGTrTGGGGMGm 
TCqAQCtjCAAtAATCGTGAAGACGA CTTGC GTATC ^^ 

GAAGCTGGAAAGTTTTATGAGATrrTGGTGGTGGAAGCAGGACAAATGAAGCTATC 
AG( :d^G' rGATCTTCGCTlTGGTCCCTTCTTGTC^ 

AA^GAApCTGAGAAGCTAGAGTTC^CCCTCGGACAAATCCCAGAAAAAAATCTGGAAG 
^GATTCAAGCrATCAAGGAGGTGCTCC^^ 



CTAGTaOATA- 



ACTGA 



Am 



I** 



^ CA4 



"hojaalogue of SEQ. ID WO. 85" 
yc TAAACGATATTAAAGACTTGATCACTC^ 
AAMtGi K3ACGGATGAGTTGCAGTTTAGCAAGAATGAAGCAAQACCIOT 
TC< CTCX AGCACCCCnTGf AGCAACACCGACn€CAGTACKrrCCTACA^ 
AOj lAG^l UjrraCAGCTCCAGCTGAAGCAAGTG^ 
GGJ lGTG( ITTTACTrGGCTGCTOGACCAGATAAACCTGCCTT 
GTC AAA( JATTQGTAATTATCGAAGCCATGAAAGTGATOAATGAAATCCCAGCTC 
AA< :GpA lATTCTCGTCTCTAACGAAGAA^ 
AA< r^q AOCACCACCACTGA 

£ CFI 90 11 homologue of SEQ. ID NO. 86" 
A7C AAA< ITAAATGGAGTAGTGGTAACAGGTTATGGAGTAACATCTCCAATCGGAAATACA^ 
TTT IGGA ATAG^TTTAGCAACTQGGAAAATCGGCATTGGTO 
ATC tGC/TAATGCGGCAGAAATCCAAGATTTTCCGTTCGATAAATACTTTC 
TIT TGAt fcACTATTCTrTATATGCCTTGTATGCA^ 
A(tC CXCT. rAAMGGGATCGTTTTGGTGTTATC^ 
TCA GQT% OITCGCCTTCATGAAAaAGGACC^ 

AAl AtG< CTTCTGGGAATGTA(kXATG€GTTTTGGTGCAAACGGTGTITC 
CTC TTCA rCAA&TGATGCGATIXK3GGATC^ 
TGC GAGC iAACAGAAGCTTCTATCACACfcTTTTGCCATCGCT 
ACA QAdf, ?ATCGAACTCGTGCTinCGATCCCATrrOA 
CAC GGA" GTTuGTTGTAGAAAGTCTTGAACACGCTGAAAAAOT 

TGC TTAC aGAA r ATACTTGTGAtGCCTACCACATGAGTTCTCCAC\T^ . 
GCC ATC/ AACTAGCCTTGGAAGAAGCTGAGATTTCTC 
CGI GAAC TCCTGKXiAATGAAAAAGGAGAAAGTGGTGCTATCXSTAGCIG 
ATC AtCA ACX^GTCrmACAGGACAm^ 
OM Gfrfy TGCGt^TAACTTT^ 

ATG TCGT rTATGGACAAGGTTraGAGAAAGAAATTCCATACGCTAT^^ 
CAC AATC CAaTTCTTGCTTrcAMCGTTGGGAG 

ZCFEU * 1 'homologue of SEQ. ID NO. 87" - , 
ATG AAC/lTCTATGATCAACTAG^GCTC 
CCT 3ATG rGGTTFCA&ACACCAAGCGT^ 
.TAATAGtCTAppGTOAOTATAAAbVAGTC 

AGC Ct$G\t GATXJGGGACTTGGAAGAAATGGCC^GCAAGAACTGAAAGAI<K3CAaGG 
A AT ATOA AGAAAAACTOAAAATrrTGCTCCTTCCAAAGGATCGAAACGATO^ 
AAT XGT 3GAGCAQCrGGTGGAGACGAAGCGGCACTm 
~ " 'QOQ AAGCXSCAAGGTTGGCGCTTTGAAGTt^TGG^ 
TflG? TCOTATGGmCAGqTC^ 

CGTGnX^CTCTGACAGAAAGCGAAGGCCGTGTTCATAC^ 
AAp TTGA ^GAGGTTGAATACGaCATTCATCC^^ 
TGC TGG^ CAGAjACGTCMTAAGGTTGCGACTC 
GAp ATGC AG<^AGAACGTAOCCAGCAGAAGAACCGCGAGAAGGCT 
GGT 3ACC WTTTCCTCAGATTCC^ 
ACP STOAGAACGGATCCGAACTTAT 



TaTji 

aactgg! 
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cr 

AAAAAl 



CC T<X* AAAACTAGATACGATTTTGTCTGGTAAATTGGACGAAGTTGTC 
kJ Ap|< ^A^GTAGAAGAATTAAACAAACTCGAGCACCACCACCACCACCACTGA 

^CIBW. L "hrjmologue of SEq. ID NO. 88" 
AT Gtt&TACACTCnAAAGCTGAAGAAGTfcGG 
AC ACT! ACGGAATTGAATACCAAGACGAGATTATCATCGTCGATC 
CT TGCT rGGTATCGACTATGTCATTCCTGACT^ 

TI ITAA rCAeACACGGACACGAGGACC^CATTGGTGGGATrCCGTTCCTACT 
TA TTTA rGCIXJGACCGCriTGCCTTGGCTTTGATCCGT^ 
QC CAA/ GTTrACGAAATCAACCACAACACCGAGT^ 
CG ACTC \CTCTATTCCAGAGCCTTTGGGGATTGTC^ 
(iA CtTT \AGTTCGACTTTACTTOA 
AA GGOC TGCTj^TCTC^^ 

CG TOgrCAGTCCATTATGAAGArTATCCAAGGTAITGAAGGA 
A 11 ATCT nCCGTCTCCAGCAGGCAACAGAAGCTGCTGTTAAjGACT 
TIF ^AT /GAAAAGGCCATTGTCAACGGAATCGATCTTGGC^^ 
OA OCp AATGAMTCAAAGATTATCCnX3CAGGAGAAGTtCTTATCCriCT 
^|VrG6:*GCpCTCTCTC^ 

HCR< TCTTCTAGTCCCATCOTGGAAACACTACT 
GTG rCGAXGmiXXACXK5TAAAGTGAACAATATCCATAC^ 
C rCATGCTCCGCTTGATTAAGCEAAAA 

rCCA sgctggactagcagtggatactggtgttgtgaaggacaatatctt^^ 

GT 3CTT< JCCCTTACIGCTGACTCAGCTCGTATCCK^AGGTCATTTCAAC 
AA \TCG rATGGGTGAAATTGGCGCAGCtGTCCTCAAAGATCGTCGCGATCTATC 
Cp IGCA JTCGCAACTGTTGACXICAMT^ 
TT< JTCTV CAXGAGAGAGTCIGGAGACTTOA^^ 

GG VCTCJ laaaataaggatgctagcgtgcaatctgtcaa ™ - ^—.-ii^ 1 ;^^ 

TC" 'ATG/ lAAATACCGAACX/TGAACCGATCATCATCCCG; 

ccacca< x^cgaccactga 

ATd^ACGC&MCAA^ 

AA< 3GCC ^cucgattgaaaaatatc^ 

GA' TTGi AGAMTCCAGTaTGTCCGTCQATATTGAAAATAATTATGAACCGC^ 

GAi lAAG 3CCCTCTTATCAATGACnTGAAAAAAGAAG^ 

AC( ICGGi lCCGTOAAGGAGAAGCGATTTCTTGGCaTITGGCCC^TATO 

C^CCGrOTGCn^^ 

GAlAtCK fACTTCGTCGATCCCC^CAAG^ 

CTt TJTT 3TGGjiAGAAGGTCAAGAACKiGCTTGlt^GCAGGTCGCGTl^ 
CA1 TGAC CGTGAAAATOAAATCAATGCCTTC^ 

n AA G^OTCAAGGaAQTCTTGTCTCGTCTGACGAGTAAA^^ 
o GAC ICGTV AGOGC^TXKnXXTTTACCCTATA 

tt ^^n-ACfKJATCGTTTTGGTAGCMGTAT^^ 

GG^CAA< iGAT^GCTCAAGCTATATACCGTrArcro 
CTmGATAC^TGGCTGT^^ 

ATGrGGTiLUA&GGTC^ 

AATUr ^TtoTTGAATAmCCCAGATATCGTAAACGTO^ 

AAGTGGGAAAAGAGCAGTGGCGACGGGTCATTGATOCCT^ 



AATMGi 
TOG ATO. 
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v4 



ac aaq: TGCGAAGGCTGAAGAAGAAATGQAAAAAATCCAGATTAAGQ^ 

TG kkQI gtgtggtagtccaatggtcattaaacttggtcgttttggtaaattc 

o CA GAT1 GCCATCATACCCAAGCAaTCGTGAAAGAGATTGGTGTTG 

^ aa ttat ixjagCgaaaaaccaagcgtaatcGcctattctatggttgc^ 
^ ac ctct rgggkcmgcctgttggtcxitgactgtccaaaatgtc 
^ qt agt^gtggjcmwaggttgttgtagcamggcgactacgaggaagaaaagato 
gc rcpjji gcac|accaccaccaccactga 

2CFS95 "homologue of SEQ. ID NO. 91" 
AT jTTTlTTTCjCATCAGTGCTGGAATTGTGACATTrrrACT 

At rTTA* 'AGA^GGCGCAAATTACAGGCCAGC^GATGCATGAGGATGTCAAACaGCATC^ 

TG 3GAC rCCTACAATGGGAGGTTTGGTTrTCT 

AD rAGC 3AATTCAGCAATAATGTGGGAATGATTTTGTTC 

AG VTQA cmCTCAAGGTCTTTCGTAAAATCAATO 

CT X^AOGTGGAGTTATCTTCTATCTTTTCTATGAGTO 
^ GT rCAT [TOGCSATTTTTCTATATriTCTTOGCT 
^ AC ^GACGGTGITCACCKJTTTaGC 
. Tpra^iGGTCAGATGGATATTCTreTAGTGATO 
jy* TA IGCA rAAGCCTGCCAAGGTCTrTATGGGTOATQTGGGAAGTTKK3 
l ^ Air 7TCT, lTOGCTCTCCACCaGGAATC^ 

tc' tat< jatgcaagtcagttatttcaaactoacaggtggta^ 
ac ?atftgagcttgggggattgtctqgtaaaggaaatcctro 

TT< fGGG IGrrdGTCTTGTAGCAAGTCTCCTGACCCTAGC^ 

aci:acc^ctga| 

* I 

SLdnm "homologue of SEQ. ID NO.^ 92" * . * 

AT< iGCAfcGCGA AnTTCACTTG AAAAAACTCGTAATATCGGTATCL\TGGCTC^CGTCGATGCCGGTAAAA 
cL^CAA n-ACTGAGCGTATTCTrTACTACACTGGTAAAATCC^ 
GT< !ACA> ^TGGACTGGATGGAGCAAGAGCAAGAACGTGGTATCACGATCACAT 
TC/ ATG< lAACAACCACgGCGTAAACATCATCGACACACCAGGACACGTO 
* AOiJTTCICTrCGTGTATTGC^TGGTGCGGmCTO 
AX lGAG ETTOGCGTCAAGCAACTOAGTACGGAGTTCC^CGTATCGT^ 

c<5< rrcq ' xuc^rccmACTt^ 

TXfc C/^Afi CGGTlTCTGAAGATGACTTCCGTCGTATC 

TAJ CGA< iotgptacggatatccttcaag^ 

CGI GAA) lAATTGATTGAAGCAGTTGCTGAAACTGACGAAGAaTTGATXJATO 
Gjjj uATfi .CTAACGMGAATTGAAAGCTCGTATOCGrAAAGCGACT^ 
TGI GTGC TTCAGCCTTCAAAAACAAAGGTGTI^ 
(X> CTTC ACAT^CAGCAATC^ 
ACC AAqJ LGCCATTTGCAGtnrTTGCCrrC^ 
Cqi GTTT teTCAGdTGTTOriTC^^ 
f <^ CGC AQGf] ATCdTTCAAATGCACGCTAACAGCCGT^ 
It GCIGCCf! TTGGfjnTTOAAAGATACTACA^ 

AQ1 CAAfl CUACQTTCC&GAACCAfiTTATGCAACT 

GAI GGG1 ATdGCCCTTCAAAAATTC 

GGt GAA> >CAGmiCTCA<WTATG<kH^ 

CAlA AGn GAAqpGMCGTAGdTGCTCCTCAAOT 

CGC QOAl TCITGAAACGTGAGTCTGGTGGTAAAGGTX^ATTC 

ACG AAG-/ AGGTjAaAGGATTCXJAATTCGAA^ 

AGC GdTT SAAA^AGGTTTGGTAGAATCTATG 

AAA GGT^ AGCmATOATCGtlpATATCAa^ 

ACT ITGCi 2TTAAAGAAGCTGClftAATX!AGCACAACCAGCT 

ACT 3TTC lAGAAGAAAACCTTGGTGATGmTGGGTCACXiTAACTaCT^ 

TGG \AGC ACACGGTAACAGCCaAATCGTTCGTGCTTACOW 

AGT rCTf( XSTTCiKKjATCfCVVAGGACGTGGTAC^ . 
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CT< JAAA 



'A 



c 

OA 



. * . ^ A^C\dTACAAGAGAAATTATTAAGAAAAATAAAGGTG 

^ „r ' . . 

' jj i ' 
2.CFE97 , "homologue of SEQ. ID NO. 93" 
AT }dCA AATrACAATATTCCATTTTCACCGC^ 

cc rrbc 3TTc^ttggatcacaacaggtcctaaaacaaaaga^ 
AC \gac acct^gactgtttgtctcaacictgcgac^ 
tg jgAcctggtgatgaagtcatcgttccagccatgacctatacggcttcatgtagw 
og 3 agc mgocctgtcatggtggatatccmgcagatacgittgagatggact 
cc f atc ictoaqaaaacfaaggtgattatcccagtagagctcgcagggato 
tc rtcglagtcgtcgagaaa^^ 
ta ftgt< jatt^tctctgatagt gktccacgcitrgg 

CT jA^T TACTtTCCTTCTCATTCCATGCTG 
AA 4X$CC MTCCAGTGATTGATGACGAAGAGATGTACAAGGA 
J**. CT, iAGC ITtSCTCrTGCCAAGATGCAACTGGGGTCATGGGAA 
|> a \CAt SACXXJATAICATGGC^^ 

^ G)r. \AGQ \CArtGTGGACCGCTATGATAGTGGTTTrGCAGGTTC 

to iaac rcitxjAATciTfx^aKx^ccrcT 
cr< :atc/ tcwagaattggctaaagcaggaattgcaagtaatc 

CA< 3CCT LTAAGAATCTTGGATTItJATATGACGAACTATCCrAA 

. AC< XHtXJCIXTITC^^ 



^GTGCTAACTTTATCAAAAAAACTCXjAGCaCCACCACCACCACXJACTG 



"homologue of SEQ. ID NO. 95" 
AT<&7TTffATACTTATTrcCGTGGATTA 
Ak' "KCTl AaMTTCCTAaTCAAGATGAAAAT^^^ 
CTC rTTT^TATGGCCTTTGGGACCSAAGCCAAAACAG^ 
CCC rTATC STTTGGTTGGTGGATTCGTAT^ 

CO XCAJ ATATCCTATCAACGTTCTCAAAAAAAGTGACCGCTCTC^ 
Cj)l< TCA> ACGATGTCAAGGGGGGCGCAGCACTGATTGC 
ACC TAO/ CCGGTCCCATGACTTTGAAGGGCTTGAT^ 
OW XCtil TATC1TCAGATATO\AGaAAATGAAT(U^ 
* \( fAATf XXIAAjaSTCTCGACGAAGAAA 
w J- TTAjI CCGCATCCCTGCOCT^TrcTTCCTATTA^ 
CAi tfCtf CATCTOGAACCCAGATAAGAAAAGAGMGAACTTGC^ 



ZCWSIOI. V "homologue of SEQ. ID NO. 9*7" 

atc ACc^0G4AtrrrtACATm 

CAC CtCC rTTGACAGAAGAAGAATTGGAATCTATC^ 
TACjAGAl ATCTlATCTOCXXJrTaGCniA 
* * "ICfc f AATTflTTCCTCCAACGTGAA^ 

lAJ AATd^CAACCAGTOGCCTACrrcAAATCCTACTGTC^ 
AQlfTGOl rACAACrGATGGTTTTCrCTATCCCAATCAAACCT 
OQA TTITC CTGAAAGCTATGATAlXKiAAGClCTrcTCAAC^ 
TAG ATA! rCCTGTXH'ATTCTCATGAAGTTrACGACATCG 
tga mj jTaAT ^GTTOAGGGAATCAATO 



I HTITpCATtf ATGTA^^ 
AAA AfGC TGAQTrCTAGGCGlAAACGAaicro 
AAG TGG/ AGCOTTTCOttX^ 
AAjC C3AG^ AATOITQCAG AAGTCATTXHTCATAAAAG 
AA^CTqGAGG^ . 



Oi' /'homologue of SEQ. ID NO. 98" 
ATGpAAAjTTTG^TTATTAACAGATG 
TCA ^^GjVGCTj^GACGTAC^TCATTATTCT 
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3 Cf€ /&2- 



Mi 



TA 3TGA AATGGCGGTCACAGACCTGGGTCTAGCTO 
OG FGA/> TGGT^CGCCCATTACCTAGAAATTGAAAATCAA 
AC \GA£ GCArGGGAACTACTTTGGAAGTCCTTGCT^ 

OA ItCG CGTATCGGCTTGATTCGTGG AGAAGAATACCATCAGTTGACGAGCGATGATTCCrnGGTrMTG 
AA TXOk. TCAACK5CTGGTCAATTGACACX?AGAAGAGGCAGAA<jCTC . 
AqTCrA rTGG<JCAAAAAGATGAAATT(l\GCCTGATnTGGGACAGTTATCC^ 
jUATAGTCACGGCTTGACCAACATGATT^ 

AGCJJAGATAAAACGGAGACACTTGTTCGTTITGC 
:CCnT|!GTTTGTATGAACGAGGAGGATGAAG 



CTKjCTJA 

ATrcpf rL 
cg 3ttg:c 



fiCFi:io[4 

AW 

GA' TA< 
AAG' 
TCACti 



GG1 CTt< 
if \ AQC AC 



CFS103||>hom6Xoque. of SEQ. ID NO. 99" 
ATfiACGATACiAGATOAAGAATA^G GTAA ACGAATTGAT^ 

TT rTGTATGAATTGCGAGATCGTTTOAAGAGAAATCAGTTTATACTCAATO 
Tr<fTCAmCCACTGGCGGGGATGGTATGCTC 

TATCGGTCTTX^TaCTGGACATTTGGGCTTC 
rJpACTAATTTGCAACTAGATACTGGM 
A VTGGTGAAGTTAAGATTTTCAGAGCACTCAACGAAGC 
Gtj TCJg4 ^GATATTGTAATAA ATGGTGTTCX^CTTTGAACGTTTTCGTC 
CG( 3ACt< jGTAGTACTGCCTATAACAAGTCIOT 
AT 'AAQ JGAOaTTGOCAGCCTTaaTAATC^^ 
AG 3aTaVGATTGAACTTATTCCAACAAGAAACGATTATCaTACTATTTC 
TfT X:GTV ATAtTGAGCGTATTGAGTATCAAATCGACCATCATAAGATTCAGTTTGTC 
TA< ICAG" TTCTGQAACCGTOTTAAGGATGCCTTTATC^ 
CA<rACX3A , 



( "homologpe dcf SEQ. ID NO. 100" 



TAC<!A 
TTA<5T 



TGIAi 

TGCCCTdUl 



rACTCGTOCAGCTCTTGAAGMGGra 
GTG ATTf C^GCltJlTCOTACCrTGGAAlTO 
CTTjTGGA AGAACCXXfTTCGTCAAATTCCT^ 



IG^AATTAAATmCATCAGATGCCCGlTC 

^aaachaaccttgggaccaaaaggtcgcaatctcgttcttgaaaaot^ 

jatgacx3gtgtgaccattgccaaagaaatcgaattggaao 
atcagaagtagcttctaaaaccaatgatatcgcaggtx3acggg^ 
gctatcgtccgtgaaggaatcaaaaacgtcacagcaggtgcaaatc 

TGtfGATtTGAAACAGCAGTTGCCGC^ 

"~ ATC^CAAGTTGCAGCCGTATCTTCTCGTTCTO 
AAGftTOGCAAAGACGGTGTCATC^ 
XCfAAGOAATOCAGrrrGACCGTGGrrACGTTTCA 

rCTTGAAAATCCGTACATTTTGATTACAGACAA^ 
y f OAAAGCATTCTCCAAAGCAATCGTCCACTCTTGAT^ 
iA CTCTTCnTTTGAACAAGATTGGTGGAACXJITCAA 
TQ/ CCGTCGCAAA(XXATGCTTGAAGATATCGa:ATCTTAACA 
^ """"t! AGTTpAMGATGQ3ACMATTGAAGCTCTrTCTCAA 

:( nTATlXjTAGAAGGTGCAGOAAATCCrGAAGCGATTTCT 
v AACtACAACITCTGAATrrGACCGTGAAAAATTC 
^mTTAAGOTTCGA^^ 
* ' CGjpTA " ~~ 



jG]|JAQCAGTCXjTAGCCAATAAACCAGAA(^GTAGCCCCAG 

IGjj oatgatgctcgaGcaccaccaccaccaccactga ^ 

^CFfetOS "homologue of SEQ, : ID NO.. 101" 
ATCmAAGATTGAAACCGTATTAGATATTTC^ 
— XAC^aCAACTACAGCAAAGTTATTTI^ 

* - . T 3TTT^TTTrGCAAAAOQCQCTGCCTTTAAAAAAGAATACCTT^ 

TAGpTTqjnrAT^TAGCTGAAAAGGACTACQAAGTCQATATCCCTO^ 



CAA 

tog no 
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CM 



M'VfcGG 



Ta:TQT(»ACtoTGAACACMCT^^ 
^AjTGA£CTCTtTGACATGATGAATCAGGCTGTG-CAA^ 

CTAGTCCATCGAGTCTATGGACTGACCTTrGATGTAGGAGTCmOTAAC 



te^AGATGCAGTTAmTTCCGGGT^^ 
k^ACCCAGGAGA^^ 

^(TAGACCTCAAACGCTATATCAAGGAACATTATCCTCTtK^ 
rGACi '^'^ICTCrrOGCTATGCGCTTGGCCCATACTCGA 

R ia ^A(^ccacaagcacx:6aaa^a^ 

gaaoctaatatagaagttotag^ 
Jcaa/gtccixagagcaggtoaaaaggxagacctaaataa 

"homplogue Of SEO. ifc wn mv 



GG 



tro] 



CCT 



TTA' 



107 ^"hoinologue of SEQ 



ACTTAAG 



«22r 9Ue SEQ - it> no. 103" 
lcgttgcaatotgaccaa^tcgta^ag 

WGJTCGAAGACCACGGA^^ 
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3 



2 cf: iioa 



3 



"homologue of SEQ. ID NO. 1 04" 



^ ^TrtGGACGMCraf^^ 

£""72 TCAGGCCTATGTOGAGCATGTTCTTCGTGAAAAG^raC^ 



GGTACTAAAACTCGAGCACCAOC 



2-CFi;i0> "homologue of SEQ. ID NO. 1 05"- • 

JAGAAGAACAACt 



IK 



^CFElli 'homologue of SEQ. ID NO. 1 07 11 



T*™ACTmAVttAX^^?£^ 
AA/ CGC1 TGAQC^GAAACaiCIOTr^n^ 
GCIATG/ G^^tC^^ 
AAC ACG( AACAGACTnTITJATG^AT^ 



V AGCCTACTTGACCXAATCttCKm^GC^^ 
X TOAGTfccrcATACTTCACAttft^^ 
> r$ ^ OCCATfCAAACOTATO^ 



GACrnxji 

ram a* at 



n^^AGCCATTCAMCGTATX^COTACGAT^ 



GACTACOAGCATCTTQAGC^TGGTGATOACTTTG 



^rccfraA^GAm^A^ 



AT(^^ACT|jG^^2^c^^cS ^^^^^T^CGTATGTTOCAaGT 
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h: 



aCPltlii "hoznologue of SEJQ. ID NO. 10.8" 
AT( rTCT( ?AAAMmC5TAGAAATCAAAGATTTAGAAATTT^^ 
CG< iTTA UUATGCTAACTTCTTTAT 

C3A- LAA(? M,CTATTGGTCGTGC^ATCATCGGTCTAAATGATACAAGTAATO 

CA lAAG vttaAtggtmgamtcgcgtgaacaagctgcggaattgattottcgaatc 

AAi ?ACC ;TCC^CAAGTTTGAATGA^ 
C& TTA1 TTAAjbGATGAAGAAGAACGTAA AGAX3 
CT( AGGiCTTGACTCGTTACCCTCATGAATTCTCAGG 
TTC GTC/ TGCAACCAGACTTTGTTATTGCAGATGAGCCAATnCAGCCTTG^ 
.AG' dTC AACnTGCTCAAAAAATTCCAAAAAGAGCTCGGCTTGACCT 
C6( ITTCT TCGCTTTATTTCAGATCGTATCGCaGTTATTTACAAGGG 
GA. ^GAA rTGTTTAACAATCCAAlTCACGCATA 

M 1 CTTC GAACbTAAGAAGGTGTTGAAGGmACGACCCAAGTCAACACGA 
AN TAT< GTAGAAATCCGTCCAGGTCACTATGTTTGGGCGAACCAAGCTGAATTAGCACGTTATC^ 
GGj iCTAi LACCtCGiVGCACCACCACCACCACCACTGA . ' 

2CFl4 j "hbmologue of SEo. ID^-NO, 109" 

AT( rAAGlTAAGTTGGAATGGATTTTCTAAAAAA 

GCGCTCC TTAGTC(nXJAGAGAGAAGCTAGtCTGGAGAAGGATGAACAG . 
CK( JCTG> .GTGAGAATGTGGTGGOAACTtTTTCTCTGCC^ 
TCi? GGAJ lTACACCGTTOCCTATGTGACAGAAGAACCCTGTOTOGTTGCG^ 
ATC AlfiA AGCGTGCAGOTGGTTTTACTGC^^ 
ATC AAGj TGCTiAATCXTTAAACrrAGCGC^ 
CCV ATC> AGCCTATCCTTCn^TCGTrA^ 
A*( rCGA lCCA&ACTTTCTCGTTGOT 

AA< ACQ .TGCTGGMGCCTIXUAACCAGTCTTAGAAGAACTX^ 
CTC TCCA ACTAtGOTACTGATTCtCTG 

AJP .GGA' X^AAdkjACOAGAGATrGCGGAGAAAATTGCGTTOGCTAGTC^ 
CCC AGC1 GQTACTCATAATAAAGGAATTmAATGGTATTGATC 
GG( GTGC CATCGAAGCTGGGGCCCATGCCTTTGCCAGTCGAGAT 
OAC GCVf GAtX^TTGAAAGAGAAGAATTGGTCGGlX3AGATGACCCTGCCG\TC 
TGC CiQJ \TCgfpCCTCAACCCACGTGTAGCTCTCAGTCATQATCTACT^ 
TAC CCcj* GA^TCGTGTCXIATCGGTCT^^ 
ATCCAGCAAGGCCACATGAAACTACAGGCOi^^ 
TTG CTCO :CTAGTAGAGCC^CTa\TCTC 

AA/ TTTA AGATCAGCOGCCGCACTCGAGCACCACCVC^CCACCACrGA 

2CFE114 , "^omologue J of SEQ, ID -NO. 1 10" 
ATG CCAA TTAdiTOOTAGAAATAAAGGACAASACTT^ 
AAC TCGA TGAXTTmAGATATtGTGGfrrCGTGATTACG 
TIT 30GT VTTAAGAGrn , AGAAGAGCGTTTGTCTrACTTTGATGAAA 
TAT rGAT rGCT<^GGATACAGCTGAGAGAGTGAAACAGGCGGCG<^ 
ATC fcAGC AGAGGAAGATGCGCAAC^KTnXjTTGGAAGAAGCT 
AAC CAA( TGATAATGCTAaGAAAGTCGCTGTTG^ 
ACC K^CC TCTCAAATCTACAATroAGAGTCAGTTGGCTATTGTrc 
U^- CGT :CA* ^GCTACTTATCTTCAAAC^GTGATGAAGCCm 
AAC COAT rCCAGCrcCAATTGAAQAAGAACCAATTCATATO 
AQA ATTjV CAAGGTCGTATTGAGGTAGCCGATAAAGAATTCTCTGAA 
AGT 3GA^ ACTCjCAAGTCCTGTAGTGAGTXXTCAAGTT 
TGI VT0i| \GAA|CCAGAAGCTCGAGCACCACGACCACCACCACTG A 

^.StGFEiiS ''homologue of SEQ. .ID NO. 111" 
rGTAXTGGACAACTAATTTAfTTAAtGGTTGCAATra 
rTTTCTACdtAAGCGAAACGAGGGGAGAm 

pTA^TGATGAAGTAGAAGCTGTAAAAAATATCCACTOAtTGGACA 
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CcjjtGj^TGGAATCAAAAATGGGTCGATTTATCTCTCAACTCr^ 

AA 3CAfi MGGCTATAACCATTCATTTCGTTTTCTCAAGGCCAGTCATCAMTTGAGCAAATTGAGAGTCA 
AATTAtfrTTGATTGAAGAAGATATO 

AA UAT \GTGGTCGTGTTCTTCATGCTrTGGAmATTTGAGGAACTTCAGCATAGAGTTGCTGAAAATTC 
AG UC^ 3TATGGTCAAGCCTTGGATGAAATTGAAAAACAATTAGAAAATATCCAATCTGAATTTTCACAA 
Ttl GlAJ lCCTTGaATTCATCGGGTGACCCTGTGGAAGCCGCAGTGAT^ 

TTi TGOi :CTrAAGTCATATTGTGGATCGTGTTCGAGCCTTGGTTACGACGCmCTACAGAATTGCCAGAT 
CA> ITTAi ^GGATITGGAAGCCGGTTATCGTAAACTAATTGATGCTAATTATCATrnXjTTGAAACGGATAT 
TG; Ia!G<S 3CGTTTCCACTTGCITrATGAAGCATTCAAGAAAAACCAAGAGAATATTCGTCAGTTGGAATTG 
OA TAAT JCCGAATATGAGAATGGACAGGC^CAAGAGaUATCAATGCCTTGTATGATATm 
AA, KVTQ nTGCTCAGAAAGTAGTGGAAAATCTACnTGCAACTCTrCCAACTTACCTTCAACATATGAAAGA 
GA VTAA rACTTrATTGGGAGAAGATATTGCACGrTTGAACAAGACCTATTTACTirCTGAGACAGCTGCA 
AG< X1AT jTTCGTCGTATTCAGACAGAATTAGAGAGTTTTaAGGCAGCTATTGTTGAGGTAACTTCAAATC 
A\( JAAG AACCAACCCAAGCTTATTGAGTTCTTGAAGAAAATCTTGAGGATTT^ 
At j ITKL .AGATGAGCAAATTTCAGTTAGTGAGCGCCTGACACVAATTGAGAAAGATGATATTAATGCAC 
_<^GR AAA VGGCCAATGTTTATGTCAATCGIXnXJCATACTATC^AGCGATACATGGAAAAACGCAATCTGCC 
hj; AG( JTAT "CCACAAACrriTCrrrGAAGTTATTCTTrACGGCAAGCAATAATACCGAGGAT^ 

TA< !A!Ad IAAAAATGATTAACATTGAATCTGTTACCCGAGTTCITGAaATTGCAACGAATGATATGGAAGC 
TTI AGA/ ACGGAAACTTATAATATTGTACAATATGCAACTTTGACAGAGCA^ 
CG< TATC IGCTqATTIGATGAACXJCATTCAAGAAGCATTTAACGAA 
TTC ATT/ TCACGCTTC^TTTCACAAGATrT^ 
CTI TGTI ^CCTCATATGAGAAAACACGTGAAACXJATTCGTTT^^ 
CACCACGA || 

2.CF1I116 ."homologue-af-.5EQ. ID NO. 1 1 2" 
ATC iC$T/ TATGTTATAAATGGTTAAaaGAATTGGTGGACATTGATGTGCCATCACAAGAGTTGGCTOAAA 
AAi iTDT< AACTACAGGAATCGAGGTAaAGGGt^TCGAATCACCAGCTGCTGGTCTCTCAAAAATTGTCGr 
CG( ITOA<iGrcTTGttOTCCGAAGATGTG<XAGAGACTCAa 
OA/ iGA6 XiTCAQATXXnTTOTGGTGCm^AATGTG^ 
GA( icxq< ITATCGCTGATAACTACiAAAATCAAAAAAGGAAAAATCCGTGGTO 

TCI GTTC VCTTXJGTGAATTGGGAATTTCTGACTCAGTTGTGCCTAAGGAATTCGCAGATGGCATCCA^ 
TIG CQTC AAGATaCCGTGCC^GGTGAGGAAGTCTTTTGTTACCrAGACrTGGATGATGAAATC^ 
TTT CCAT ^(^C€AAAC^GTGCAGATGCCCrTTCTATGTGTGGAGTGGCTCACGAAGTGGCAGCCATCrA 
TG/ CAA( rGCAQTGAACTTTAAAGAATTTACTCTAACAGAAA^ 

GTC AGd/ TOAGACAOAGAAGGCGCCrTACTATGCAGCTCGTATCTiXKS 

gtc CACk atgg;Iitck:aaaaccttctcatgaacgaaggaatccgtcccatcaataacgtagtggacgtgac 

Ca/ CTAC ATCCTGCTCTATITTGCTOWCC^ 
. TGC GT07 GCGTOAAGCGiXJTCCTGGTGAAAAATTGGTGAaTrr^ 

GA/ «3A( ICTAGTCATCACTGTCGCAaACAAGCCAGTAGCCCTTGCAGGTGTCATG^ 
^ GV ATClCTO^AAAATCTAGTCGTGTTOTCCTTGAAGCrc 
t? . AAC TGG" CGCCTGMCCITCGTTCTCAGTCATCTTCTCGCTTTGAA^ 
AAI GAAC ICCCTTGATGCGGCAGCTAGCXnXjATTGCGGAACT^ 
GTJ rCAG CGGQTGAGCTTGATAGTTCAGATGTAGAAGTTTCTrCAACCCrrc 
CqC MCI GAGCTOTXTrATGCTGATGTAGAAGAOTTCTIXX^C 
CAC ACAC CTITACAGTGAGAGTCCCACGTCGT^ 
AA1 rGCT IXTOVTCTAIXKmATGACEGCTTOC^ 
TTG ^CAG CCACACAAAMCTCCGCCOTCAAGTTCGTACHATTG^^ 
TCA 2CTA rACTCTAACAACTCCTGAAAAAGCAGTTGAGrrTACGGCTCAACCAAGTAACOT 
CAT 3TGG CCAATGACTOTGGATCGTTCAGTCCrcraSTCAAA^ 
CCT V.CAA CGTGtjCTCGTAAGAATAAAAACTTGGCCCTITACCAGATIWAAA^ 
TA A rCCA 4AACUAGMCntCAAATCAAATCAAC AGTmG(OT 
. AAA GATI rCGAAACAGCAGCAGTTCCAGTTGATTTeTTCTAra 

TCG [TTCK iGAC'pDCAAGTAACCTATACAGCAACATCTGAAATCGCTA GCCTTCATCCAGOTCGTACA GCC 
. GTG ^TTT 'ACTOGKjTGACCAAGTTCTTGGTTTCXnTGGCCAAGTGGATCCAGTC 

TAT /CCA< 5AAACGTATGTGGCTGAGCTTMCCTITCAGCTATCGAAGCTGCGCTTCAGCCAG 
TTTfTAG^T^a^MTTCqSGXjCAGT^GCCGTGAOT 
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^ 0/ TAGC m^CGQACGAAGAAGTCGC^CG^ 

2 ^™i!?.J'^ Oli,ol0 5ru e of SEQ. ID NO 113" 



CApCACrOA 



II 



^TCTCOAGCAGCACCAOCAC 
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SEQUENCE LISTING 

<110> Dougherty, Thomas J. 
Pucci, Michael J. 
Dougherty, Brian A. 
Davison, Daniel B. 
Bruccoleri, Robert E. 
Thanassi, Jane A. 

<120> NOVEL BACTERIAL GENES AND PROTEINS THAT ARE ESSENTIAL 
FOR CELL VIABILITY AND THEIR USES 

<130> 30436. 44USUX 

<14 0> Not yet known 
<141> 2000-12-30 

<160> 226 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 708 
<212> DNA 

<213> Streptococcus pneumoniae 

<400> 1 . 
atgatttatg caggaattct tgccggtgga actggcacac gcatgggaat cagtaacttg 60 
ccaaaacaat ttttagagct aggtgatcga cctattttga ttcatacaat tgaaaaattt 120 
gtcttggaac caagtattga aaaaattgta gttggggttc atggagactg gg.ttttacat 180 
gcagaagatc ttgtagataa atatcttcct cttcataagg aacgtattat cattacaaag 24 0 
ggtggtgctg accgcaatac aagtattgag aacatcattg aagccattga tgcttatcgc 300 
ccgcttactc cagaggatat cgttgttacc cacgattctg ttcgtccatt tattacgctt 360 
cgcatgattc aagacagtat caaacttgct caaaatcatg acgcagtgga tacagtagta 420 
gaagcagtgg atactatcgt tgaaagtacc aatggtcaat tcattacagg tattccaaat 480 
cgtgctcacc tctatcaggg acaaacacct caaacattcc gttgcaagga cttcatggac 540 
ctttatggat ctctttctga tgaagagaag gaaatcttga cagatgcatg taaaatcttt 600 
gtgatcaaag gaaaagatgt agccttggcc aaaggcgaat actcaaatct gaagattaca 660 
accgtaacag atttgaagat tgcaaaaagt atgattgaga aagactag 708 

<210> 2 
<211> 558 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 2 

atggctaacg taattattga aaaagctaaa gagagaatga cccagtctca ccaatcactt 60 
gctcgtgaat ttggtggtat ccgtgctggt cgtgccaatg caagcttgct tgaccgtgta 120 
catgtagaat actatggagt cgaaactcct cttaaccaaa tcgcttcaat tacgattcca 180 
gaagcgcgtg ttttgttggt aacaccattt gacaagtctt cattgaaaga catcgaacgt 240 
gccttgaacg cttctgatct tggtatcaca ccggctaatg acggttctgt gattcgcttg 300 
gttatcccag ctcttacaga agaaactcgt cgtgaccttg ctaaagaagt gaagaaggtc 360 
ggcgaaaatg ctaaagtggc tgtccgcaat atccgtcgcg atgctatgga cgaagctaag 420 
aaacaagaaa aagcacaaga aatcactgaa gacgaattga agactcttga aaaagatatt 4 80 
caaaaagtaa cagacgatgc tgttaaacac atcgacgaca tgactgctaa caaagagaaa 540 
gaacttttgg aagtctaa 550 
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<210> 3 
<211> 1353 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 3 

atgggtaaat attttgggac tgatggagtc cgtggagaag ctaacctaga actaacacca 60 
gaattagcct ttaaactagg acgttttgga ggctatgttc ttagtcaaca tgaaacggaa 120 
gcgccgaaag tctttgtagg acgtgacaca cgtatttcag gggaaatgct ggaatcggcc 180 
ttggtggcag gtctccttfc agtagggatt cacgtataca aacttggtgt ccttgcaaca 240 
ccagcagtag cttacttggt tgaaactgaa ggagcaagtg ccggtgtcat gatttctgct 300 
agccacaacc cagcccttga taacggaatc aagttctttg gcggtgatgg cttcaaacta 360 
gatgatgaaa aagaagcaga aattgaagcc ttgctagatg ctgaggaaga cactcttcct 420 
cgtccaagtg cagaaggctt aggaattttg gtagattatc cagaaggctt gcgtaagtat 480 
O gaaggatacc gtgtgtcaac tggaactcct cttgatggaa tgaaggttgc cttggataca 540 
gctaatggag cagcttctac cagtgcccgt caaatctttg cagaccttgg tgcccaattg 600 
acggttatcg gggaaacacc agacggtctt aacatcaacc ttaatgttgg ttcaacacat 660 
ccagaagccc ttcaagaagt ggtcaaagaa agtgggtcag ctattggttt ggcctttgat 720 
ggagacagtg accgcttgat tgctgttgat gagaatggtg acatcgtcga tggtgacaag 780 
attatgtaca tcatcggaaa atacctttct gaaaaaggac aattggctca aaatacaatt 840 
gtgacaactg ttatgtctaa ccttggtttc cacaaggcct tgaatcgcga aggtattaac 900 
aaggcagtta ctgcagttgg tgaccgctac gttgttgaag aaatgagaaa atcaggctac 960 
aaccttggtg gtgaacagtc tggtcacgtt atcttgatgg attacaatac cacaggtgat 1020 
ggtcaattat cagcagttca attgactaaa atcatgaagg aaactggtaa gagcttatca 1080 
gagttggcgg cagaagtaac gatttatcca caaaaattag ttaatatccg agtggaaaac 1140 
gtcatgaagg aaaaggccat ggaagtgcca gctatcaagg ccatcatcga gaagatggaa 1200 
gaagaaatgg cggggaacgg ccgtatcctt gttcgtccaa gtggaacaga gcccctcttg 1260 
cgtgttatgg cagaagcgcc tacaacagaa gaagtggact actatgttga taccatcaca 1320 
gatgtagttc gtgctgaaat tgggattgac taa 1353 



20 



25 



30 



35 



40 



<210> 4 
<211> 705 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 4 

atgaaaaaaa tactaattgt agatgatgag aaaccaatct cggatattat caagtttaat 60 
atgaccaagg aaggttatga agttgtaact gcttttaatg gtcgtgaagc gctagagcaa 120 
tttgaagcag agcaaccaga tattattatt ctggatttga tgcttccaga aattgatggt 180 

ttagaagttg ctaagaccat tcgtaagaca agcagtgtgc ccattcttat gctttcagcc 240 

aaagatagtg aatttgataa ggttatcggt ttggaacttg gggcagatga ctatgtaacg 300 

aaacccttct ccaatcgtga gttgcaggcg cgtgttaaag ctcttctgcg tcgttctcaa 360 

cctatgccag tagatggtca ggaagcagat agtaaacctc aacctatcca aattggggat 420 

ttagaaattg ttccagacgc ctacgtggct aaaaaatatg gcgaagaact agacttaacc 480 

4!> catcgtgaat ttgagctttt gtatcattta gcatcgcata caggtcaagt catcacgcgc 540 

gaacacttgc ttgagactgt ctggggttat gactattttg gtgatgtccg cacagttgat 600 

gtgactgtac gacgtctgcg tgagaagatt gaagatacgc ccagccgacc agagtatatc 660 
ttgacgcgcc gtggtgtagg gtattacatg agaaataatg cttga 705 

50 <210> 5 

<211> 1107 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <400> 5 

atggaagaaa ttctctgtat tggttgtgga gcaaccattc agacgacaga taaagctggt 60 

cttggtttta ccccccagtc ggcacttgaa aaaggtttgg agactggcga agtctattgc 120 
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caacgctgtt tccgtctccg ccactacaat gaaatcacag atgtccagtt gacgaacgat 180 
gatttcctca agctcttgca cgaggtggga gacagtgatg ctttagtggt caatgtcatt 240 
gatatctttg attttaatgg atctgtcatc ccaggtttac cacgtttcgt ctcgggcaat 300 
gatgtcctct tggtaggaaa taaaaaagat atccttccta agtcagttaa gtctggtaag 360 
attagccagt ggctcatgaa acgtgcccat gaagaaggtc ttcgtccagt cgatgtggtc 420 
ctaacttcag cacaaaataa acatgccatt aaggaagtca ttgacaagat tgaacactac 4 80 
cgtaagggcc gcgatgtcta tgtggtcggt gtgaccaacg ttggaaaatc aactctaatc 540 
aatgctatta tccaagaaat cacgggtgat cagaatgtca tcactacttc acgcttccca 600 
gggacaacct tggacaaaat agagattccg cttgacgacg gatcttatat ttacgatacg 660 
ccgggaatta tccaccgcca ccagatggct cactacttga cggccaaaaa cctcaagtat 720 
gtcagtccta aaaaggaaat caagcctaag acctatcagc ttaatcctga gcaaacccta 780 
tttttaggtg gtttgggacg ctttgacttt atagcaggag aaaagcaagg atttactgct 840 
ttctttgata atgaactcaa actccatcgt agcaagcttg aaggagctag tgctttctac 900 
gataagcacc tgggaactct tctgacacca ccaaatagca aggaaaaaga agatttccca 960 
aggctagtcc agcatgtctt taccattaaa gataagacag acctagtcat ctcaggccta 1020 
ggatggattc gtgtaacagg cacagcaaaa gtcgccgtct gggcaccaga aggcgtcgcc 1080 
gtcgtcacac gaaaagcaat tatttaa 1107 

<210> 6 
20 <211> 1461 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 6 

25 atgtatccag atgatagttt gacattgcac acggacttgt accagatcaa catgatgcag 60 
gtttactttg accaagggat tcacaataag aaggcggtct ttgaggtgta tttccgccaa 120 
cagcctttta agaacggcta tgcggttttt gcaggtttgg aaagaattgt gaactatctt 180 
gaagacttgc gtttttcaga tagtgatata gcctatttgg agtcgcttgg ttatcatggg 240 
gcgttcttgg attaccttcg caatttcaag ttggagttga ccgttcgttc tgcccaagaa 300 
ggggatttgg tttttgctaa tgaaccgatt gtgcaggtgg aaggacctct agcccaatgt 360 
cagttggtcg aaacggctct tttgaacatc gtcaactacc agaccttggt ggcgacgaag 420 
gcagctcgta ttcgttcggt tatcgaagat gaacccttga tggagtttgg gacacgtcgg .4 80 
gctcaagaaa cggatgcggc catctgggga acacgcgcag cggtgattgg tggcgccaat 540 
ggaaccagca acgtgcgtgc gggtaagctc tttgacattc ctgttttggg aacccatgcc 600 
35 catgccttgg tacaggttta tggcaatgac tatgaggctt tcaaggctta cgctgcgacc 660 
cacaaaaatt gtgtctttct tgtggatacc tatgataccc ttcgcatcgg tgtaccagct 720 
gccattcagg tggcgcgtga gctgggtgat cagattaact ttatgggtgt gcggattgac 780 
tctggggata ttgcctacat ttctaagaaa gtccgtcagc aactggacga ggctggattt 8 40 " 
acagaggcta agatttatgc ttctaatgat ttggacgaaa atactatcct caatctcaag 900 
atgcaaaagg ccaagattga tgtctggggt gtgggtacca agctgattac agcctatgac 960 
cagccggctc ttggggcggt ttacaagatt gttgcaatcg aagatgaaac tggtcagatg 1020 
cgcaatacga ttaagctgtc taataatgcg gaaaaagtgt cgacgccagg taagaagcag 1080 
gtgtggcgca ttaccagtcg tgaaaaaggt aagtcagaag gtgattacat cacttatgat 1140 
ggtgtggata ttagcgacat gacagaaatc aagatgttcc atccgaccta tacatacatc 1200 
aagaagacgg ttcgtaattt tgatgccgtt cctctcttgg tggatatctt caaagaagga 1260 
atattagttt acaacttgcc tagtttgact gacattcagg attatgcccg taaagaattt 1320 
gacaagttgt gggatgagta taagcgtgtg ctcaatccgc agcactatcc agtggatttg 1380 
gcgcgtgatg tatggcaaga taagatggac ttgattgata agatgcgcaa ggaagccctt 1440 
ggtgaaggag aagaagaatg a 1461 



30 



40 



45 



50 



55 



<210> 7 
<211> 852 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 7 

atggctacta ttcaatggtt tcctggtcac atgtctaaag ctcgtcgaca ggtgcaggag 60 
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aatttaaaat ttgttgattt tgtgacgatt ttagtagatg cacgcttgcc tctatctagt 120 
caaaatccta tgttgaccaa gattgttggt gataaaccaa aactcttgat tttaaacaag 180- 
gccgacttgg ctgatccagc aatgaccaag gaatggcgtc agtattttga atcacaagga 240 
atccagacgc tagctatcaa ctccaaagag caagtgactg taaaagttgt aacagatgcg 300 
gccaagaagc tcatggctga taagattgct cgccagaaag aacgtgggat tcagattgaa 360 
accttgcgta ctatgattat cgggattcca aacgctggta aatcaactct gatgaaccgt 420 
ttggctggta aaaagattgc tgttgttgga aacaagccag gggtcacaaa aggtcaacaa 480 
tggcttaaaa ccaataaaga cctggaaatc ttggatacac cggggattct ctggcctaag 540 
tttgaggatg aaactgttgc acttaagttg gcattgactg gagctatcaa ggatcagttg 600 
cttcctatgg atgaggttac catttttggt atcaattatt tcaaagaaca ttatccagaa 660 
aagctggctg aacgcttcaa acaaatgaaa attgaagaag aaccgtctgt gattattatg 720 
gatatgaccc gcgccctcgg tttccgtgat gactatgacc gtttttacag tctcttcgtg 780 
aaggaagttc gtgatggcaa actcggtaac tataccttag atacattgga agacctcqat 840 
ggcaacgatt aa y 852 

<210> 8 
<211> 471 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 8 

atgattaaca atgttgtact tgtagggcgt atgacacgtg acgctgagtt gcgttatacc 60 
ccatcaaatg tagcagttgc gacttttact cttgcagtaa accgtacatt taagagtcaa 120 
aatggtgaac gtgaggctga ttttatcaat gtcgttatgt ggcgccaaca ggctgaaaat 180 
O cttgctaact gggctaaaaa aggctcactt atcggggtga caggtcgtat ccagactcgt • 240 
agttacgata accagcaagg acaacgtgtc tacgtgacag aggtcgtggc tgagaatttc 300 
caaatgttgg aaagccgtag tgtgcgtgag ggccacacag gtggagctta ctctgcacca 360 
actgcaaact attcagcacc tacaaattca gtaccagact tttcacgtaa tgaaaatcca 420 
tttggagcaa caaacccatt ggatatttca gatgatgatt taccattcta a 471 



10 



15 



20 



30 



35 



<210> 9 
<211> 975 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 9 

atgaaaacgc gtattacaga attattgaag attgattatc ctattttcca aggagggatg 



60 



gcctgggttg ctgatggtga tttggcaggg gctgtttcca aggctggagg attaggaatt 120 
atcggtgggg gaaatgcccc gaaagaagtt gtcaaggcca atattgataa aatcaaatca 180 
W ttgactgata aaccctttgg ggtcaacatc atgctcttat ctccctttgt ggaagatatc 24 0 
gtggatctcg ttattgaaga aggtgttaaa gttgtcacaa caggagcagg aaatccaagc 300 
aagtatatgg aacgtttcca tgaagctggg ataatcgtta ttcctgttgt tcctagtgtc 360 
gctttagcta aacgcatgga aaaaatcggt gcagacgctg ttattgcaga aggaatggaa 420 
gctggggggc atatcggtaa attaacaacc atgaccttgg tgcgacaggt agccacagct 480 
« atatctattc ctgttattgc tgcaggagga attgcggatg gtgaaggtgc tgcggctggc 540 
tttatgctag gtgcagaggc tgtacaggtg gggacacggt ttgtagttgc aaaagagtcg 600 
aatgcccatc caaactacaa ggagaaaatt ttaaaagcaa gggatattga cactacgatt 660 
tcagctcagc actttggtca tgctgttcgt gctattaaaa atcagttgac tagagatttt 720 
gaactggctg aaaaagatgc ctttaagcag gaagatcctg atttagaaat ctttgaacaa 780 
OU atgggagcag gtgccctagc caaagcagtt gttcacggtg atgtggaggg tggctctgtc 840 
atggcaggtc aaatcgcagg gcttgtttct aaagaagaaa cagctgaaga" aatcctaaaa 900 
gatttgtatt acggagccgc taagaaaatt caagaagaag cctctcgctg gacaggagtt 960 
gtaagaaatg actaa yv y g ^ 

55 <210> 10 
<211> 423 
<212> DNA 
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<213> Streptococcus pneumoniae 



<400> 10 

atgatcgata 

gaccgtgtct 

gagcctttct 

gaagccttgg 

ctggtctttt 

caattggtta 

aaggctgaag 

taa 



ttcaaggaat 
tggaggtgag 
ttaacggcca 
cgcaaactgc 
acgctggtat 
tgacagcgac 
tggatggcaa 



caaagaagcc 
cgaggatacc 
ctttcctcaa 
cggtgtgttg 
ggataaggtt 
ttttgtaaaa. 
gcttgcagcc 



cttccccacc 
attgttgcta 
tacccagtta 
gagttatcaa 
aagttcaaga 
cgtcgtggca 
agtggtaccc 



gttatcctat 
tcaaaaatgt 
tgccaggtgt 
aacctgaaaa 
agcaagttgt 
ccatagctgt 
ttacttttgc 



gcttctagtg 60 
gaccatcaac 120 
tctgattatg 180 
taaaggaaaa 240 
accaggcgac 300 
ggttgaagca 360 
aattgggaac 420 
423 



<210> 11 . 
<211> 1023 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 11 

atgattaatc 

gaggctattg 

gcggatcagc 

atggcaatga 

gttggtcaaa 

tatgaaaact 

gagtttgttt 

gcagccatta 

gctcatagca 

gccaatatta 

gaaaagttgg 

gagttggcct 

aatgacttga 

tataaagtca 

tctcgttctg 

gccaatcgtc 

caccgtgtct 

taa 



aaatttatca 

accaagagaa 

gttactacca 

ttcacgagtc 

aagttgtcat 

acatgacagg 

ctctccctaa 

cagagtttgt 

agcgggagcg 

tcaactatac 

aactcttctc 

ttgaccatgc 

ttcgctacat 

atctcaatac 

gtcgcattga 

ttaaaaatat 

ttgcaaccga 



actaactaaa 

tcatatcctt 

gggaaaacgt 

atgtggaatc 

gattcccaat 

gapccatttc 

agatcgtgtg 

cagtgtgggc 

gatccccgtt 

tttgccagaa 

atttgccaaa 

ttttgaatgt 

tcgtcctcag 

tcgcgatgcc 

ttttgaaaat 

cctttatcta 

tttaaacaca 



cctaagttta 

atccgtccca 

gatcccaaga 

gtcatttctg 

cagtctccta 

ttgtctagtg 

gtggcttatg 

atgcacgcta 

attggagatg 

gcagagattg 

gaatgctata 

tgtggtggtg 

ggaacaattc 

ttagaaaagg - 

gctatccaaa 

gaagaacctg 

gcctttaaaa 



tcaatgtcaa 

actacatggc 

ttttgaataa 

acccgagcgg 

tgcagagtga 

gatttgatgg 

atgctattga 

tgaatcgtct 

gaagtttagc 

tggttattgg 

ttacggataa 

atggtactgg 

tcatgatggg 

gcttgctctt 

tgatgaaagt 

taagagaaat 

cagtgtttaa 



atatcaggaa 

tgtctgtcat 

aaagettcca 

aacctacgag 

tgaagaattc 

ctttatgaga 

agatacggtt 

attgactctt 

ttttgtggtt 

tcgtcattgg 

tattcctgaa 

accagctatt 

agttagcgaa 

ggttgggtca 

caagaaattt 

taaagatatt 

gtgggaagta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 . 

1020 

1023 



<210> 12 
<211> 1344 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 12 

atgaacttaa 

cgtcttggac 

ttacaaaacc 

ctgacaactg 

ccaagcggtg 

aaaactggga 

gactatatcc 

ttcggtgaaa 

gctactgttc 

atagagtatt 

gaagggattc 

aacttgggtg 

ttgacaaaac 

tacggtatcc 



aaactacttt 

gtggaagtac 

tagctaagaa 

ccctcactgt 

ccaacatgat 

aaaatattgc 

agcctagtct 

tctatactac 

tccttaacgg 

ttggttttga 

tctgtcctga 

cctatatctg 

tggttgagtt 

aaatcggcgg 



gggccttctt 

gctcccaggg 

ctacgagatt 

cggcatttta 

tacagggatt 

cgtcctcgaa 

ttttgtcatt 

ctataacatg 

agacagtcca 

cttggaaaaa 

ctgccaaggc 

tgagggttgt 

gaccaacaat 

gctctataat 



gctggacgtt 
aaagtcgccc 
gtcgttgtca 
aaagaggtct 
gcaacaacct 
attgacgaag 
actaatatct 
atattggatg 
cttttctaca 
ggaccagccc 
atcctcaaat 
ggatgtaaac 
cgctctcgct 
atctataacg 



cttcccactt 
ttcaatttga 
ctggaacaaa 
atggtcaagt 
tcctaacagc 
ccagtctatc 
tccgtgacca 
ccattcggaa 
agccaactat 
aactggctca 
atgagcataa 
gtcctgatct 
ttgtcataga 
ccctagctgc 



cgttttaagc 60 
taaagatatt 120 
tggaaaaacc 180 
tctaaccaat 240 
caaatcttct 300 
tcgtatctgt 360 
gatggaccgt 420 
agttccaact 480 
tccaaaccct 540 
ctacaatacc 600 
tacctatgca 660 
cgactatcgt 720 
cggccaagaa 780 
tgtggccatc 840 
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gcccgtttcc tcggcgcaga ttcccaactc atcaaacagg gatttgacaa gagccgtgct 900 
gtctttggac gccaagaaac ctttcatatc ggtgacaagg aatgtaccct tgtcttgatt 960 
aaaaatccag tcggtgcaac ccaagctatc gaaatgatca aactagcacc ttatccattt 1020 
agcctatctg tcctccttaa tgccaactat gcagatggaa ttgacactag ctggatctgg 1080 
gatgcagact ttgagcaaat cactgacatg gacattcctg aaatcaacgc tggcggtgtt 114 0 
cgtcattctg aaatcgctcg tcgcctccga gtgactggct atccagctga gaaaatcact 1200 
gaaacgagta atctggagca agttctcaag accattgaga atcaagactg caagcatgcc 12 60 
tatattctgg caacttatac tgccatgctg gaatttcgtg aactgctggc tagtcgtcag 1320 
attgttagaa aggagatgaa ctaa 1344 

<210> 13 
<211> 783 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 13 

atggtttata cttcactttc ctcaaaagat ggcaattacc cctatcagct caacattgcc 60 
cacctctacg gaaatctcat gaatacctac ggggacaatg gaaacatcct catgctcaag 120 
tatgtggctg aaaaactggg agcccatgtg accgttgaca tcgtttctct ccatgatgac 180 
tttgatgaaa atcactacga catcgccttt ttcggtggtg gtcaagactt tgaacaaagt 240 
atcattgcag acgacctacc tgctaaaaaa gagagcattg acaactacat ccaaaacgac 300 
ggtgtagttc tggctatctg cggtggtttc caactattgg gtcaatatta tgttgaagct 360 
tcaggaaaac gtatcgaagg gctaggggtc atgggacact acacgctcaa ccagaccaat 420 
aaccgtttta tcggtgacat caagattcac aatgaagatt tcgatgaaac ctactatgga 4 80 
15 tttgaaaatc accaaggtcg taccttcctc tctgatgacc aaaaaccgct gggacaggtt 540 
gtctatggaa atggaaacaa cgaagaaaag gtcggtgaag gggttcatta taagaatgtc 600 
tttggttcct acttccacgg gcctatcctc tctcgtaatg ccaatctggc ttatcgccta 660 
gttactactg ccctcaagaa gaaatatggt caggacatcc aactccctgc ctatgaggac 720 
attctcagcc aagaaatcgc tgaagagtac agtgacgtca aaagcaaggc tgacttttct 780 
30 taa ?83 

<210> 14 
<211> 276 
<212> DNA 
35 <213> Streptococcus pneumoniae 



<400> 14 

atggcaaaca aacaagattt gatcgctaaa gtagcagaag ctacagaatt aactaagaaa 60 
gactcagcagcagcagttga agctgtattt gcagcagtag ctgactatct tgcagctggt 120 
gaaaaagttc agttgatcgg ttttagtaac tttgaagttc gtgagcgcgc agaacgtaaa 180 
ggtcgcaacc cacaaactgg taaagaaatg acaattgcag cttctaaagt accagcattc 240 
aaagctggta aagctcttaa agacgctgtt aaataa 276 



<210> 15 
45 <211> 840 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 15 

atgggaattg ctctagaaaa tgtgaatttt acatatcaag aaggtactcc cttagcttca 60 
gcagctttgt cggatgtttc tttgacgatt gaagatggct cttatacagc tttaattggg 120 
cacacaggta gtggtaaatc aactatttta ca.actcttaa atggtttatt ggtgccaagt 180 
caagggagtg tgagggtttt tgatacctta atcacctcga cttctaaaaa taaagatatt 240 
cgtcaaatta gaaaacaggt tggcttggta tttcagtttg ctgaaaatca gatttttgaa 300 
gaaacggttt tgaaggacgt tgcttttgga ccgcaaaatt ttggagtttc tgaagaagat 360 
gctgtgaaga ctgcgcgtga gaaactggct ctggttggaa ttgatgaatc actttttgat 420 
cgtagtccgt ttgagctgtc agggggacaa atgagacgtg ttgccattgc aggcatactt 480 



6 
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gccatggagc cagctatatt agtcttagat 
agaaaagagt tgatgaccct gttcaaaaaa 
gtaacgcatt tgatggatga tgttgctgaa 
ggacgtttag taaagggggg caaaccaagt 
5 gaagttcagt tgggagtacc taaaattacg 
gtgtcattta aacgattacc ggttaagata 

<210> 16 
<211> 930 
10 <212> DNA 

<213> Streptococcus pneumoniae 



gagccaacag ctggtctaga tcctctaggg 540 
ctccaccagt cagggatgac catcgtcttg 600 
tatgcgaatc aagtctatgt aatggaaaag 660 
gatgtctttc aagacgttgt ttttatggaa 720 
gccttttgta aacgattggc tgatagaggc 780 
gaggagttca aggagtcgct aaatggatag 840 



<400> 16 

atggatattc aatttttagg aacgggggct ggtcagccct ctaaagcccg caacgtttca 60 
lt> agtctcgccc tgaaactttt ggatgagatt aacgaagttt ggctctttga ctgtggagaa 120 
ggtacgcaaa atcgcattct ggaaaccaca attcgaccac gtaaggtcag caaaatcttt 180 
attacccatc tgcatggaga ccacattttt ggtttgccag gtttcctttc tagccgtgcc 240 
tttcaggcca atgaagagca gacagatttg gaaatctacg gacctcaagg aatcaagtca 300 
tttgtcttaa ccagccttcg tgtgtcaggt tctcgtctgc cctaccgcat tcatttccat 360 
gagtttgacc aagattctct aggtaaaatt cttgaaatcg ataaattcac tgtgtatgca 4 20 
gaggagctgg accacactat tttctgtgtt ggctatcgtg tcatgcaaaa ggatctagaa 4 80 
gggacgctgg atgctgaaaa actcaaggct gctggtgttc cgttcggccc gctttttggt 54*0 
aaaatcaaaa atggccagga tcttgttttg gaagacggaa ctgaaatcaa ggcagcagac 600 
tatatctcag cgccacgtcc aggtaagatt atcactattt taggagacac tcgaaaaacg 660 
O gatgccagtg tgcgtctggc tgtcaatgca gatgtcctag ttcatgagtc cacttatggc 720 
aagggtgatg aaaaaattgc tcgtaaccat ggtcactcaa ctaatatgca agctgcacaa 780 
gtagcggtag aagcaggtgc caaacgcctc ctactcaacc atatcagtgc ccgtttcctc 84 0 
tcaaaagata ttagcaaact caagaaagac gctgccacaa tttttgaaaa tgtccatgtg 900 
gtcaaagact tggaagaagt ggaaatctag g30 

<210> 17 
<211> 1662 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 17 

atgagtaata tcagtttaac aacacttggt ggtgtgcgtg agaatggaaa aaatatgtac 60 
attgctgaaa ttggagagtc catttttgtt ttgaatgtag ggttaaaata tcctgaaaat 120 
gaacaattag gggtcgatgt ggtgattcca aacatggatt acctttttga aaatagcgac 180 
cgtattgctg gggttttctt gacccacggg catgcggatg cgattggtgc tctaccttat 24 0 
ctcttggcag aggctaaagt tcctgtattt gggtctgagt tgaccattga gttggcaaag 300 
ctctttgtca aaggaaatga tgccgttaag aaatttaatg atttccatgt cattgatgag 360 
aatacggaga ttgattttgg tgggacagtg gtttccttct tccctacgac ttactccgtt 4 20 
ccagagagtc tgggaattgt cttgaagaca tcggaaggaa gcatcgttta tacaggtgac 4 80 
ttcaaatttg accaaacggc tagtgaatct tatgcaactg attttgctcg tttggcagag 54 0 
attggtcgtg acggcgtcct ggctctcctc agtgattcgg ccaatgcaga cagcaatatt 600 
caggtggcta gtgaaagtga agttagggat gaaattaccc aaactattgc tgactgggaa 660 
ggtcgtatca tcgttgcagc tgtttccagt aatctttctc gtattcagca gatttttgac 720 
gctgcggata aaacaggtcg acgtatcgtc ttgacaggat ttgatattga aaatatcgtc 780 
cgcacagcga ttcgtcttaa gaagttgtct ttagccaacg aaattctctt gattaagcct 8 40 
aaagatatgt ctcgctttga agaccatgag ttgattattc ttgagacagg tcgtatgggt 900 
gaacctatca atggacttcg taagatgtcg attggtcgcc atcgttatgt agaaatcaag 960 
gatggggacc tggtctatat tgctacggct ccgtctattg ctaaagaagc ctttgttgcg 1020 
cgtgtagaaa atatgattta tcaggcaggt ggggttgtca aattgattac ccaaagttta 1080 
catgtatcag ggcacggaaa tgtgcgtgat ttgcagctga tgatcaatct tttgcaacct li40 
aagtacctct tccctgtcca aggggagtat cgtgagttgg atgctcacgc taaggctgcc 1200 
atggcagttg ggatgttgcc agaacgcatc ttcattccta aaaaggggac gaccatggct 1260 
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tacgagaatg gagactttgt tccagctgga tcggtttcag caggagatat cttaattgat 1320 

gggaatgcca ttggtgatgt tggaaatgtt gttcttcgtg accgtaaggt cttgtcagag 1380 

gatggaattt tcatcgtggc tattacagtc aaccgtcgtg agaagaaaat tgtggctaga 1440 

gctcgtgttc acacgcgtgg atttgtttat ctcaagaaga gtcgcgatat tctccgtgaa 1500 

3 agttcagaat tgattaacca aacggtagaa gattatcttc aaggagatga ctttgactgg 1560 

gcagatctta aagggaaggt tcgagataat ttgaccaagt atctctttga ccaaaccaag 1620 

cgtcgtccag ctattttacc agtagtcatg gaagcaaaat aa . 1662 

<210> 18 
10 <211> 951 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 18 

atgacaaaag aatttcatca tgtaacggtc ttactccacg aaacgattga tatgcttgac 60 
gtaaaacctg acggtatcta cgttgatgcg actttgggtg gagcaggcca tagcgagtat 120 
ttattaagta aattaagtga aaaaggccat ctctatgcct ttgaccagga tcagaatgcc 180 
attgacaatg cgcaaaaacg cttggcacct tacattgaga agggagtggt gacctttatc 240 
aaggataact tccgtcattt acaggcacgt ttgcgcgaag ctggtgttca ggaaattgat 300 
ggaatttgtt atgacttggg agtgtctagt cctcaattgg accagcgtga gcgtggtttt 360 
tcttataaaa aggatgcgcc actggacatg cggatgaatc aggatgctag tctgacagcc 420 
tatgaagtgg ttaatcatta tgactatcat gatttggttc gtattttctt caaatacggt 480 
gaggataaat tctctaaaca gattgcgcgt aagattgagc aagcgcgtga agtgaagccg 540 
attgagacaa cgactgagtt agcagagatt atcaagttgg tcaaacctgc caaggaactc 600 
aagaagaagg gtcatcctgc taagcagatt ttccaggcta ttcgaattga agtcaatgat 660 
gaactggggg cggcagatga gtccatccag caggctatgg atatgttggc tctggatggt 720 
agaatttcag tgattacctt tcattcctta gaagaccgct tgaccaagca attgttcaag 780 
gaagcttcaa cagttgaagt tccaaaaggc ttgcctttca tcccagatga tctcaagccc 84 0 
aagatggaat tggtgtcccg taagccaatc ttgccaagtg cggaagagtt agaagccaat 300 
aaccgctcgc actcagccaa gttgcgcgtg gtcagaaaaa ttcacaagta a 951 

<210> 19 • 
<211> 999 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 19 

atgagtagaa ttttagataa tgagataatg ggggatgagg agttagtaga acgcacgctc 60 
cgtcctcagt atttacgtga atatatcgga caggataagg tcaaggacca gctacaaatc 120 
tttattgaag ctgccaaaat gcgggatgaa gcgctggatc atgtgctctt atttgggcct 180 
ccaggtctcg ggaaaacgac catggccttt gttattgcca acgaactggg agtcaatctt 240 
aagcagacgt cgggtccagt cattgaaaaa gccggagatc tggtagctat tttgaatgag 300 
ttagagcctg gggatgtact ttttattgat gagatccatc gtttgccaat gtcagtggaa 360 
gaggtgcttt atagtgctat ggaggacttc tacatcgata ttatgattgg ggctggtgag 420 
ggtagtcgta gtgttcattt gg.agttacca ccttttacct tgattggtgc gacgactcgg 480 
gctggtatgc tctccaatcc gctacgggca cgttttggga ttacaggcca tatggagtat 540 
tatgcccatg ctgacttgac agaaattgtc gagcggacgg cagatatttt tgagatggaa 600 
atcactcatg aggcagcatc tgagttggcc ctacgtagtc gtgggacccc tcgtattgcc 660 
aatcgtctcc tcaagcgcgt gcgcgatttt gcccagataa tggggaatgg ggtaattgat 720 
gatattatta ccgataaggc tttgactatg ctggatgttg accatgaagg tttggactat 780 
gtggatcaaa aaatccttcg taccatgatt gagatgtaca gtggaggacc tgttggtcta 840 
ggaactcttt ctgtgaatat cgccgaagag cgtgagacag ttgaagacat gtatgagcct 900 
tacttgattc aaaaaggttt tatcatgcgg acacggtctg gacgggtggc gactgctaag 960 
gcatatgagc acttaggtta tgaatacagt gaaaaataa 9 999 

<210> 20 
<211> 1311 
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<212> DNA 

<213> Streptococcus pneumoniae 
<4 00> 20 

atgagtatgt ttttagatac agctaagatt aaggtcaagg ctggtaatgg tggcgatggt 60 
atggttgcct ttcgtcgtga aaaatatgtc cctaatggag gcccttgggg tggtgatggt 120 
ggtcgtggag gcaatgtggt cttcgttgta gacgaaggac tacgtacctt gatggatttc 180 
cgctacaatc gccatttcaa ggctgattct ggtgaaaaag ggatgaccaa agggatgcat 240 
ggtcgtggtg ctgaggacct tagagttcga gtatcacaag gtacgactgt tcgtgatgcg 300 
gagactggca aggttttaac agatttgatt aaacatgggc aagaatttat cgttgcccac 360 
ggtggtcgtg gtggacgtgg aaatattcgt tttgcgacac caaaaaatcc tgcaccggaa 420 
atctctgaaa atggagaacc aggtcaggaa cgtgagttac aattggaact aaaaatcttg 480 
gcagatgtcg gtttagtagg attcccatct gtagggaagt caacactttt aagtgttatt 540 
acctcagcta agcctaaaat tggtgcctac cactttacca ctattgtacc aaatttaggt 600 
15 atggttcgca cccaatcagg tgaatccttt gcagtagccg acttgccagg tttgattgaa 660 
ggggctagtc aaggtgttgg tttgggaact cagttcctcc gtcacatcga gcgtacacgt 720 
gttatccttc acatcattga tatgtcagct agcgaaggcc gtgatccata tgaggattac 780 
ctagctatca ataaagagct ggagtcttac aatcttcgcc tcatggagcg tccacagatt 840 
attgtagcta ataagatgga catgcctgag agtcaggaaa atcttgaaga atttaagaaa 900 
aaattggctg aaaattatga tgaatttgaa gagttaccag ctatcttccc aatttctgga 960 
ttgaccaagc aaggtctggc aacactttta gatgctacag ctgaattgtt agacaagaca 1020 
ccagaatttt tgctctacga cgagtccgat atggaagaag aagcttacta tggatttgac 1080 
gaagaagaaa aagcctttga aattagtcgt gatgacgatg cgacatgggt actttctggt 1140 
gaaaaactca tgaaactctt taatatgacc aactttgatc gtgatgaatc tgtcatgaaa 1200 
tttgcccgtc agcttcgtgg tatgggggtt gatgaagccc ttcgtgcgcg tggagctaaa 1260 
gatggggatt tggtccgcat tggtaaattt gagtttgaat ttgtagacta g 1311 



20 



25 



<210> 21 
<211> 519 
30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 21 

atgaactact ttaatgttgg gaaaatcgtt aatacgcagg gattacaggg tgagatgcga 60 
35 • gtcttgtctg tgacggattt tgcagaagaa cggtttaaaa aaggagctga gctggctttg 120 
tttgatgaaa aagatcagtt tgtccaaaca gtgaccatcg ctagccaccg taaacagaag 180 
aactttgaca ttattaaatt caaagatatg taccatatca atactatcga aaagtacaag 240 
ggatacagtc tcaaggtcgc tgaggaagat ttgaatgacc tagacgatgg tgaattttac 300 
tatcacgaga ttatcggttt ggaagtctat gagggtgata gcttggttgg aaccatcaag 360 
gaaatcctgc aaccaggtgc taatgatgtc tgggtggtca aacgaaaagg caaacgtgat 420 
ttgcttttac cttatatccc accagtggtt ctcaatgttg atattccaaa taaacgggtc 4 80 
gatgtggaaa tcttagaagg gttagacgat gaagattga * 519 



40 



<210> 22 
45 <211> 720 
<212> DNA 

<213> Streptococcus pneumoniae 



50 



<400> 22 

atgaagattg atattttaac cctctttcca gagatgtttt ctccactgga gcactcaatc 60 
gttggaaagg ctcgagaaaa agggctcttg gatatccagt atcataattt tcgagaaaat 120. 
gctgaaaagg cccgtcatgt agatgatgag ccctacggag gcggtcaggg catgttgctc 180 
agagcacaac ctattttcga ttcctttgat gctattgaaa agaaaaatcc gcgcgttatt 240 
ctcctcgatc ctgctggaaa gcagtttgat caggcttatg ctgaagattt ggctcaagag 300 . 
55 gaagagctaa tctttatctg tgggcactat gagggttatg atgagcgcat taagaccttg 360 
gtaacagatg agatttccct aggcgactat gtcctcactg gtggagaatt ggcagctatg 420 
accatgattg atgctacagt tcgcctgatt ccagaagtga ttggcaagga gtctagccac 480 
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caagatgata gtttttcttc aggtctttta gaatatcatc agtacacacg tccctataat 540 
tatcgaggca tggtcgtgcc agatgtattg atgagtggcc atcatgaaaa gattcgSq 600 
tggcgattgt acgagagttt aaagaaaacc tacgagcgca gaccggattt acttqaaca? 660 
tatcaactga cagtagaaga agaaaaaatg ctggcagaaa Laaa'gaaaa caaagaS 720 

<210> 23 
<211> 561 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 23 

atgattgaag caagtaaatt aaaagctggt atgacctttg aaacagctga cgocaaatta 60 

itttlZtl ccaggtaaag" gaaaca'cgat cat^gtatg 120 
aaattgcgtg atgtccgtac tggttctaca tttgacacaa gctaccgtcc agaggaaaaa 180 

15 ttTTS 9 f at t atC9a ^ctgtccca gctcaatact tgtacaaaat gg^gacac* 240 

gcatacttca tgaatacaga aacttacgac cagtacgaaa tccctgtagt caatStqaa 300 

Tdt* T ! ttaC \\ CCt tgaaaactct gatgtgaaaa tccaa?tc?a cggaKgaa 360 

gtgatcggtg tcaccgttcc tactactgtt gagttgacag ttgctgaaac tcaacca?ct 420 

20 a ! Caaag 9tg ctactgttac aggttctggt aaaccagcaa cgatggaaac tg«ct£S 480 

20 sssss: sss: r agcag9a caaaaactcg ttat — "o° 
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<210> 24 
<211> 1572 
25 <212> DNA 

<213> Streptococcus pneumoniae 
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<400> 24 

atggcatttg aaagtttaac agaacgtttg cagaacgtct ttaaaaatct acqtaaaaaa 60 
ggaaaaatct ctgaatctga tgtccaagag gcaaccaaag aaattcgct^ ggccttgc'c 120 
gaggcogacg ttgccttgcc tgttgtaaag gactttatca agaaagJtcg ?gagcg?gca 180 
■ gtcgggcatg aggtcattga tacacttaat cctgcgcaac agattattaa aatcStS 240 
gaggaactga cagccgtttt aggttctgat acggcagaaa ttatcaagtc accSawS 300 
cca.cc.tc. tcatgatggt tggtttacaa g.gggctggta aaacaacctt tgc^ggKaa 360 
ttggccaaca aactcaagaa agaagaaaat gctcgtcctt tgatgattgc ggcggatat? 420 
ctgccattga ccagcttaag accttgggac aacagattga JgtgLtgtc 4S0 
tttgcacttg gaacagaagt accagctgtt gagattgtac gtcaaggttt ggagcaaqcc 540 
ctcctcatL C T aCg tt ta ^ tCtt ^« gatactgcgg otcgtttgc. 22 
ctcctcatga atgagcttcg tgatgtgaaa .acattggctc aaccaaatga aatcttgctt 660 
gtcgttgatg ctatgattgg tcaggaagca gccaatgttg cgcgtgag?t taatg^caq 720 
tlT^J* Ct ^t Cat ^accaag attgatggcg atartJgtgg tggtgcS? 780 
ctgtctgttc gtcacattac tggaaaacca atcaagttca ctggtacagg t^aaLgaS 840 
acggacattg aaaccttcca cccagaccgc atgtctagcc gtatccttgg tatggcqqat 900 
«t2«2i .J* att *»* M agcttctcag gaatacgatg aacaaaaag? ccttgSXg 9 0 
gctgagaaga tgcgcgaaaa cacctttgat tttaatgatt tcatcgatca attagatcaa 1020 
gtgcaaaata tggggccgat ggaagacttg ctcaagatga ttccaggtat ggcaaac^a? iSsS 
SK£ aaa T tgaa ^ggatgaa cgccagattg ctcgtalacg tgceattgjg il40 
tcttcgatga cacctgaaga gcgtgaaaac ccagatttgt taaatccaag ccgtcgccgt 1200 
cgtattgctg ctggttctgg aaatacattc gtcgaagtca ataaattcat caaggact?t 1260 
« a "eagctcat gcagggtgtt atgtctgggg atatgaataa aatga'gaag 13S 
caaatgggga ttaatccaaa taaccttcct . aaaaatatgc caaatatggg aggLtggat 1380 
atgtctgccc ttgaaggaat gatgggacaa ggcggtatgc ctgactta?? agrtcSga 1440 
KEEK tlT^l" ^f CCa ^ "tggtggcg gtttgaaagg tZa.*3S Hit 
a^cgcaagt aa' ^ eat * MC * t atggctaaca aaatgaagaa agcgaagL'g 1560 

1572 

<210> 25 
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<211> 846 
<212> DNA 

<213> Streptococcus pneumoniae 

5 <400> 25 

atgtatctta ttgaaatttt aaaatctatc 
tggttgccga tttccagtac aggtcatttg 
aatcaaaatg aagcctttat gtccatgttt 
gcagttatgg tgatttattt taacaagctc 

10 gaagttcgta agacttggag actatggttg 
ggtgtcttta aatttgatga ttggtttgat 
ctcatgttga ttatctacgg ggttgccttc 
gctatcgagc caagtgtaac agagttggac 
ggactcttcc aagttcttgc tcttttacca 

15 ggtggtttgt taaatggaac cagtcgttca 
attcccgtta tgtttggagc tagtgcctta 
. ctcttgagct ttgggcaatt gtttttgctc 
agcatggtgg ctattcgctt cttgaccagc 
ggtaaatacc gtatcgtgct tggtagtgtt 

20 gtataa 

<210> 26 
<211> 1290 
<212> DNA 
25 <213> Streptococcus pneumoniae 



ttcttcggga ttgttgaagg aattacggaa 60 
attttagcag aggagtttat ccaataccaa 120 
aatgtcgtga ttcagcttgg tgctatttta 180 
aatcctttta aaccgactaa ggacaaacag 240 
aaggtcttga ttgctacttt acctttactt 300 
acccacttcc ataacatggt ttcagttgct 360 
atctatttgg aaaagcgcaa taaagcgcgt 420 
aagcttcctt atacgaccgc tttctatatc 480 
gggactagcc gttcaggtgc aacgattgtc 540 
gttgtgacag aatttacctt ctatcttggg 600 
aagattttca aatttgtgaa agccggagaa 660 
ttggtcgcga tgggagtagc ttttgcggtc 720 
tatgtgaaaa aacacgactt cacccttttt 780 
ttgctacttt acagttttgt ccgtttattt 840 

846 



<400> 26 

atgggattat ttgaccgtct attcggaaaa aaagaagaac ^ctaaaatcga agaagttgta 60 
aaagaagctc tggaaaatct tgatttgtct gaagatattg "agcctgcctt cacagaagct 120 
30 gaggaagttt ctcaagaaga agcagaggtt gaaagttctg aagaatctgt gttccaagaa 180 
gaggatagtc aagacacagt cgaagaaaat ctggatttag agccagttgt agaggtttct 240 
caagaagaag tagaagaatt tccaaactca caagaagtca cagaggaaga gaagcttgag 300 
cacgaaggaa ctgtagaaga aaataatttt gaagtgcttg aaccagaagc tcctcaaaca 360 
gaagaaactg ttcaggaaaa atatgaccgc agtcttaaga aaactcgcac aggtttcggt 420 
35 gcccgcttga atgccttctt tgctaacttc cgctctgttg acgaagaatt tttcgaggaa 4 80 
ctggaagaac tgttgattat gagtgatgtt ggtgtccaag tcgcttctaa cttaacggag 540 
gaactacgtt acgaagccaa gcttgaaaat gccaagaaac ctgatgcact tcgtcgtgtc 600 
atcattgaga aattggttga gctttatgaa aaggatggta gctacgatga aagcatccac 660 
ttccaagata acttgacagt tatgctcttt gttggtgtga atggtgttgg gaaaacaact 720 
40 tctatcggaa aactagccca ccgctacaaa cgagctggta agaaggtcat gctggttgca 780 
gcagatacct tccgtgcggg tgcagtagct cagctagctg aatggggccg acgagtagat 840 
gttccagtag taactggacc tgaaaaagct gatccagcca gcgtggtctt tgatggtatg 900 
gaacgtgccg tggctgaagg tatcgatatt ctcatgattg atactgctgg tcgtctgcaa 960 
aataaggata accttatggc tgagttggaa aagattggtc gtattatcaa acgtgttgtg 1020 
45 ccagaagcac cacatgaaac. cttcttggca cttgatgcat caacaggtca aaatgcccta 1080 
gtacaggcca aagaattttc gaaaatcaca cctttaacgg gaattgtttt gactaagatt 1140 
gatggaactg ctcgaggagg tgtggttcta gccattcgtg aagaactcaa tattcctgta 1200 
aaattgattg gttttggtga aaaaatcgat gatattggag agtttaactc agaaaacttc 1260 
atgaaaggtc tcttggaagg tttaatctaa 1290 



50 



55 



<210> 27 
<211> 498 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 27 

atgtatattg aaatggtaga tgaaactggt caagtttcaa aagaaatgtt gcaacaaacc 60 
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caagaaattt tggaatttgc agccaaaaaa ttaggaaaag aagacaagga gatggcagtc 120 
acttttgtga ccaatgagcg tagtcatgaa cttaatctgg agtaccgtga caccgaccgt 180 
ccgacagatg tcatcagcct tgagtataaa ccagaattgg aaattgcctt tgacgaagag 24 0 
gatttgcttg aaaatccaga attggcagag atgatgtctg agtttgatgc ctatattggg 300 
gaattgttca tctctatcga taaggctcat gagcaggccg aagaatatgg tcacagcttt 360 
gagcgtgaga tgggcttctt ggcagtacac ggctttttac atattaacgg ctatgatcac 420 
tatactccgg aagaagaagc ggagatgttc ggtttacaag aagaaatttt gacagcctat 480 
ggactcacaa gacaataa 498 

<210> 28 
<211> 768 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 28 

atgagtattc gagtaattat tgccggtttt aagggaaaga tgggccaggc tgcttgtcag 60 ' 
atggtattga ctgatccaga cttggacttg gtggcagttt tggatccttt tgagtctgag 120 
tcagaatggc agggtattcc tgttttcaag gataaggctg atttagctgg ttttgaagcg 180 
gatgtctggg tagattttac tactccagct gttgcctacg aaaatacacg ttttgctctt 240 
gaaaatggct ttgctccagt agttggaacg actggtttca cgagtgaaga aattgcagag 300 
ctaaaagaat tttctcgtgc ccaagacttg ggtggcctga ttgcccctaa ctttgccttg 360 
ggtgctgtct tactcatgca atttgcgacg caggctgcca aatatttccc aaatgtggag 420 
attattgagc tccatcatga caaga'aaaag gatgctccga gtggaacagc cattaaaaca 480 
gctgagttga tggcagaggt tcgagagtca attcagcaag gtgcagcaga tgaggaagag 540 
ctgattgctg gtgctcgtgg tgctgacttt gatggtatgc gcatccactc agttcgtttg 600 
ccaggcttgg tagctcatca ggaagtcatc tttggcaatc agggagaagg gttgaccctc 660 
cgtcatgact cctatgatcg catctccttc atgacaggag tcaatttggg aattaaagaa 720 
gttgtcaagc gtcatgagct tgtctatgga ttagaacact tattatga 768 

<210> 29 
<211> 276 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 29 

atggcaaaca aacaagattt gatcgctaaa gtagcagaag ctacagaatt aactaagaaa 60 
gactcagcag cagcagttga agctgtattt gcagcagtag ctgactatct tgcagctggt 120 
gaaaaagttc agttgatcgg ttttagtaac tttgaagttc gtgagcgcgc agaacgtaaa 180 
ggtcgcaacc cacaaactgg taaagaaatg acaattgcag cttctaaagt accagcattc 240 
aaagctggta aagc'tcttaa agacgctgtt aaataa 276 

<210> 30 
<211> 921 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 30 

atgactaaaa cagccttttt atttgctggt caaggtgccc agtatctagg gatgggacgg 60 
gatttctatg atcagtatcc gattgttaaa gaaacgattg atcgagcgag tcaggtgctc 120 
ggttatgatt tgcgttatct catcgatacg gaagaggaca aactcaatca gacccgctat 180 
acgcaaccag ccattctagc gacttcggtt gctatctacc gtttattgca agaaaagggc 240 
tatcagcctg atatggtcgc tggtttgtct cttggagaat actctgcctt ggtggcaagc 300 
ggcgccttgg attttgaaga tgcggttgcc ttggtagcta agcgtggagc ctatatggaa 360 
gaagcggctc.ctgctgactc tggcaagatg gtagcagttc tcaatacgcc agtagaggtc 420 
attgaagaag cctgtcaaaa agcttctgaa cttggagtgg ttactccagc caactataac 480 
acacctgcac aaatcgtcat tgctggagaa gtggttgcag ttgatcgagc ggttgaactt 540 
ttgcaagaag caggtgccaa acgcttgatt cctcttaagg tgtcaggtcc ctttcacacc 600 
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gctctccttg agccagctag ccagaaacta gctgaaactc tagctcaggt aagtttttca 660 
gattttactt gtcccctagt cggcaataca gaagctgctg tgatgcaaaa agaggacatt 720 
gctcagctct tgacgcgtca ggtcaaggaa cccgttcgtt tctatgaaag tattggggtc 780 
atgcaagaag caggcataag caactttatc gagattggac cggggaaagt cttgtcaggt 840 
tttgttaaaa aaattgatca aactgctcac ttagctcatg tggaagatca agcgagttta 900 
gtagcacttt tagaaaaata g 921 

<210> 31 
<211> 732 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 31 

atgaaactag aacataaaaa tatctttatt acaggttcga gtcgtggaat tggtcttgcc 60 
atcgcccaca agtttgctca agcaggagcc aacattgtct taaacagtcg tggggcaatc 120 
tcagaagaat tgctcgctga gttttcaaac tatggtatca aggtggttcc catttcagga 180 
gatgtatcag attttgcaga cgctaagcgt atgattgatc aagctattgc agaactgggt 240 
tcagtagatg ttttggtcaa caatgcaggg attacccaag atactcttat gctcaagatg 300- 
acagaagcag attttgaaaa agtgctcaag gtcaatctga ctggtgcctt taatatgaca 360 
caatcagtct tgaaaccgat gatgaaagcc agagaaggtg ctatcattaa tatgtctagt 420 
gttgttggtt tgatggggaa tattggtcaa gctaactatg ctgcttctaa ggctggcttg 480 
attggcttta ccaagtctgt ggcacgcgag gtcgctagtc ggaatatacg agtcaatgtg 540 
attgctccag gaatgattga gtctgatatg accgctatct tatcagataa gattaaggaa 600 
gctacactag ctcagattcc gatgaaagaa tttgggcagg cagagcaggt tgcagatttg 660 
acagtatttt tagcaggcca agattatcta actggtcaag- tggttgccat tgatggtggc 720 
ttaagtatgt ag yy 732 

<210> 32 
<211> 831 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 32 

atgggagtga aaaagaaact aaagttgact agtttgctag gactgtctct gttaatcatg 60 
acagcctgtg cgactaatgg ggtaactagc gatattacag ccgaatcggc tgatttttgg 120 
agtaaattgg tttacttctt tgcggaaatc attcgctttt tatcgtttga tattagtatc 180 
ggagtgggga ttattctctt tacggtcttg. attcgtacag tcctcttgcc agtctttcag 240 
gtgcaaatgg tggcttctag gaaaatgcag gaagctcagc cacgcattaa ggcgcttcga 300 
gaacaatatc caggtcgaga tatggaaagc agaaccaaac tagagcagga aatgcgtaaa 360 
gtatttaaag aaatgggtgt cagacagtca gactctcttt ggccgatttt gattcagatg 420 
ccggttattt tggccctgtt ccaagcccta tcaagagttg actttttaaa gacaggtcat 480 
ttcttatgga ttaaccttgg &agtgtggat acaacccttg ttcttccgat tttagcagca 540 
gtattcacct ttttaagtac ttggttgtcc aacaaagctt tgtctgagcg aaatggcgct 600 
acgactgcga tgatgtatgg gattccagtc ttgattttta tctttgcagt ttatgcgcca 660 
ggtggagtcg ccctatactg gacagtgtct aatgcttatc- aagtcttgca aacctatttc 720 ■ 
ttgaataatc cattcaagat tatcgcagag cgcgaggccg tagtacaggc acaaaaagat 780 
ttggaaaata gaaaaagaaa agccaagaaa aaggctcaga aaacgaaata a 831 

<210> 33 
<211> 1230 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 33 ' 

atgaagatta gtaagaggca cttattaaat tattccatct tgattcccta cttactttta 60 
tctattttgg gcttgattgt ggtctattcg accaccagtg ctattttaat tgaagaaggc 120 
aagagcgcct tgcagttggt tcgaaaccaa ggaatctttt ggattgttag tttgatactg 180 
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10 



15 



attgccttaa tttataaatt gagactagat tttttgagaa atgagcgact aatcatttta 240 
gttatattaa tagaaatgct tttattgttc ttggctcgtt ttattggtat ttcagtaaac 300 
ggggcatacg gttggatttc ggttgcagga gtaactattc agccagctga gtacttaaaa 360 
atcattatta tttggtattt agctcaccga ttctccaaac agcaagaaga aatagctact 420 
tatgattttc aagttttgac tcaaaatcaa tggcttcccc gtgcttttaa tgattggcga 480 
ttcgttctcc tagttctgat tggaagtttg ggaattttcc ctgatttagg aaatgcgact 540 
attttagtct tggtttcctt gattatgtat acagttagtg gaatcgctta tcgctggttt 600 
tcaaccattc tggcgctcgt atctgccact tctgtctttg tcttgaecac tatcagccta 660 
atcggtgttg agaccttttc aaaaattcca gtatttggct atgtagccaa gcgctttagt 720 
gcctttttta atccttttgc cgatcgtgct gatgcaggtc accagttagc taattcttat 780 
tttgccatgg tcaatggtgg ttggtttggt ctaggtcttg gaaactcgat tgaaaaacga 84 0 
ggttatttgc cagaagctca tacagacttt gtcttttcta tcgtgattga agaatttggc 900 
tttgttggtg ccagtcttat tttagctctc ttgtttttca tgattttgcg gattatcttg 960 
gtcggtatcc gagcggagaa tcctttcaat gccatggttg cactcggtgt cggagggatg 1020 
atgttggttc aggtatttgt caatatcgga gggatttcgg gcttgattcc atctacagga 1080 
gtgactttcc ctttcttatc ccagggtgga aatagtcttc tagtcttatc agtggcagta 1140 
gcctttgtct taaatattga tgccagtgaa aaacgcgcta agttgtaccg agaattggaa 1200 
aatcaaccaa tgaaccttct gttgaagtag 1230 

20 <210> 34 

<211> 1260 
<212> DNA 

<213> Streptococcus pneumoniae 
25 <400> 34 

atgctcggaa ttttaacctt tattctggtt tttgggatta ttgtagtggt gcacgagttc 60 
gggcacttct actttgccaa gaaatcaggg attttagtac gtgaatttgc catcggtatg 120 
ggacctaaaa tctttgctca cattggcaag gatggaacgg cctataccat tcgaatcttg 180 
cctctgggtg gctatgtccg catggccggt tggggtgatg atacaactga aatcaagaca 240 
iU ggaacgcctg ttagtttgac acttgctgat gatggtaagg ttaaacgcat caatctctca 300 
gcftaaaaaat tggatcaaac agccctccct atgcaggtga cccagtttga ttttgaagac 360 
aagctcttta tcaaaggatt ggttctggaa gaagaaaaaa catttgcagt ggatcacgat 420 
gcaacggttg tggaagcaga tggtactgag gttcggattg cacctttaga tgttcaatat 480 
caaaatgcga ctatctgggg caaactgatt accaattttg caggtcctat gaacaatttt 540 
35 atcttaggtg tcgttgtttt ttgggtttta atctttatgc agggtggtgt cagagatgtt 600 
gataccaatc agttccatat catgccccaa ggtgccttgg ccaaggtagg agtaccagaa 660 
acggcacaaa ttaccaagat cggctcacat gaggttagca actgggaaag cttgatccaa 720 
gctgtggaaa cagaaaccaa agataagacg gcaccgactt tggatgtgac tatttctgaa 780 
aaggggagtg acaaacaagt cactgttaca cccgaagata gtcaaggtcg ttaccttcta 8 40 
ggtgttcaac cgggggttaa gtcagatttt ctatccatgt ttgtaggtgg ttttacaact 900 
gctgctgact cagctctccg aattctctca gctctgaaaa atctgatttt ccaaccggat 960 
ttgaacaagt tgggtggacc tgttgctatc tttaaggcaa gtagtgatgc tgctaaaaat 1020 
ggaattgaga atatcttgta cttcttggca atgatttcca tcaatattgg gatttttaat 1080 
cttattccga ttccagcctt ggatggtggt aagattgtgc tcaatatcct agaagccatc 1140 
cgccgcaaac cattgaaaca agaaattgaa acctatgtca ccttggccgg agtggtcatc 1200 
atggttgtct tgatgattgc tgtgacttgg aatgacatta tgcgactctt ttttagataa 1260 

<210> 35 
<211> 594 
50 <212> DNA 

<213> Streptococcus pneumoniae 



40 



45 



<400> 35 

atgtacgcat atttaaaagg aatcattacc 

55 accaatggta ttggttatat cctgcatgtg 

aatcaggagg ctcagattta tgtgcatcag 

ggatttcgct cagaggatga gaaaaagctc 



aaaattactg ccaaatacat tgttcttgaa 60 
gccaatcctt atgcctattc aggtcaggtt 120 
gttgtgcgtg aggacgccca tttgctttat 180 
tttcttagtc taatttcggt ctctgggatt 240 
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ggtcctgtat.,cagctcttgc tattatcgct gctgatgaca atgctggctt ggttcaagcc 300 
attgaaacca agaacatcac ctacttgacc aagttcccta aaattggcaa gaaaacagcc 360 
cagcagatgg tgctggactt ggaaggcaag gtagtagttg caggagatga ccttcctgcc 420 
aaggtcgcag tgcaagcaag tgctgaaaac caagaattgg aagaagctat ggaagccatg " 480 
ttggctctgg gctacaaggc aacagagctc aagaaaatca agaaattctt tgaaggaacg 540 
acagatacag ctgagaacta tatcaagtcg gcccttaaaa tgttggtcaa atag 594 

<210> 36 
<211> 774 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 36 

atgaagaata atcgtatttt agcactttct ggaaatgata tttttagtgg tggtggactg 60 
tcagctgatt tggctaccta taccttgaac ggcttgcatg ggtttgtagc agtgacttgt 120 
ttgacagcct tgacagaaaa aggatttgaa gtctttccaa ctgatgatac catttttcaa 180 
catgaattag atagcttgcg tgatgtggaa tttgggggaa ttaagattgg tcttctccct 240 
actgtcagtg tggctgagaa ggccttggac tttatcaaac aacgcccagg agtacctgtg 300 
gtgttggatc ctgtcttggt ctgcaaggaa acgcatgatg tagctgtcag tgagctctgc 360 
caagagttga ttcgcttctt cccttatgtc agtgtgatta cgcctaatct cccagaagca 420 
gaattattat ccggtcagga aattaaaacc ttggaagaca tgaaaactgc agcgcagaaa 480 
ttgcatgatt taggagcgcc agcagtcatt atcaagggag gcaatcgtct tagtcaggac 54 0 
aaggctgtgg atgtctttta tgatggacag acctttacta tcctagaaaa tccagttatc 600 
caaggccaaa atgctggtgc aggttgtacc tttgcctcta gcattgccag tcacctggtt 660 
aaaggtgata aatttttgcc agcagtagaa agctctaagg ctttcgttta tcgtgctatt 720 
gcacaagcag atcagtatgg agtaagacaa tatgaagcaa acaaaaacaa ctaa 77 4 

<210> 37 
<211> 1239 
<212> DNA 

<213> Streptococcus pneumoniae - 
<400> 37 

atgattgaaa cggagaaaaa agaggagcga gtcctgctga ttggtgtgga attgcagggt 60 
atggacagtt ttgacctctc catggaagaa ttggctagtt tagcgaaaac ggcaggggca 120 
gtcgttgtag atagctacag acaaaaacgt gaaaaatatg attccaagac cttcgtcggc 180 
tctggtaagt tggaagagat tgcgcttatg gtggatgcag aagaaatcac tactgtcatc 24 0 
gtcaacaatc gtctgacccc aaggcagaat gtcaatctag aggaagttct cggtgttaag 300 
gtcattgacc gtatgcagtt gattttggat atctttgcca tgcgggctcg aagccatgaa 360 
gggaagctcc aagtccacct agcccaattc aaatacctct tgcctcgctt ggttggtcag 420 
gggattatgc tcagccgtca ggcaggggga attggttccc gtggtcctgg tgaaagccaa 480 
ctggagctga accgtcgtag cgttcgcaat caaatcacgg atatcgagcg ccagcttaag 540 
gtggttgaga aaaatcgtgc gactgtcaga gaaaaacgtt tggagtctag cacttttaag 600 
attggtttga ttggttatac taatgctggg aaatcaacta tcatgaacat cttgaccagt 660 
aagacccagt atgaagcaga tgagctcttt gcgactctgg atgcgacaac caagagtatt 720 
catctgggag gcaatctcca agtaactttg acagataccg ttggctttat ccaagatttg 780 
ccgacagagt tggtgtccag tttcaagtca accttggaag aaagcaagca tgtggacctt 840 
ctggttcatg ttatcgatgc tagcaatcct taccacgagg- agcatgaaaa aacggttctc 900 
tccatcatga aagacctgga catggaagat attcctcact tgacgcttta taataaagcg 960 
gatttggtgg aggatttcac gcctacccaa acgccatata ccctcatttc tgccaagtct 1020 
gaggacagtc gtgaaaactt gcaagcatta ttgctagata agattaagga aatttttgaa 1080 
gcatttaccc tgcgagtgcc tttttcaaag tcctacaaga ttcatgattt agagagtgtt 1140 
gcaattctgg aagaacgtga ttatcaggaa gacggcgaag tgattacagg ctacatttcc 1200 
gagaaaaata aatggaggtt agaagaattt tatgactga 12 39 

<210> 38 
<211> 483 
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<212> DNA 

<213> Streptococcus pneumoniae 



<400> 38 

atggcagaaa aaacatatcc tatgaccctt gaggaaaagg aaaaacttga aaaagaatta 60 
gaagaattga aattggttcg tcgaccagaa gtggtagaac gcattaagat tgcccgttca 120 
tacggtgacc tttcagaaaa cagtgagtac gaagcagcta' aggatgaaca agcctttgtc 180 
gaaggacaaa tctctagctt agaaacaaaa atccgctatg ctgaaatcgt caatagcgac 240 
gcagttgccc aggacgaagt agcgattggt aaaacagtca ccatccaaga aattggtgag 300 
gacgaagaag aagtttatat tatcgtaggt tcagctggtg cggatgcctt tgcaggtaag 360 
gtttcaaatg aaagcccaat tgggcaggcc ttgattggca agaaaacagg tgatacagca 4 20 
accattgaaa cgcctgttgg tagctatgat gtaaaaatct tgaaggttga aaaaacagcc 480 
taa 483 

<210> 39 
<211> 570 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 39 

atg'accaaat tacttgtagg cttgggaaat ccaggggata aatattttga aacaaaacac 60 
aatgttggtt ttatgttgat tgatcaacta gcgaagaaac agaatgtcac ttttacacac 120 

gataagatat ttcaagctga cctagcatcc tttttcctaa atggagaaaa aatttatctg 180 

gttaaaccaa cgacctttat gaatgaaagt ggaaaagcag ttcatgcttt attaacttac 240 

tatggtttgg atattgacga tttacttatc atttacgatg atcttgacat ggaagttggg 300 

aaaattcgtt taagagcaaa aggctcagca ggtggtcata atggtatcaa gtctattatt 360 

caacatatag gaactcaggt ctttaaccgt gttaagattg gaattggaag acctaaaaat 420 

ggtatgtcag ttgttcatca tgttttgagt aagtttgaca gggatgagta tatcggtatt 4 80 

ttacagtctg ttgacaaagt tgacgattct gtaaactact atttacaaga gaaaaatttt 540' 
gagaaaacaa tgcagaggta taacggataa 570 

<210> 40 
<211> 852 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 40 

atgattttaa ttacaggggc aaatggccaa 
gaacgtaatg aagaatacgt ggcagtagat 
atggttgaga aagtttttga agaggtgaaa 
accgctgttg atgcagcaga ggatgaagga 
gggacaaaaa atgtcgcaaa agcatctgaa 
acggactatg tctttgacgg taagaaacca 
ccagatccac agacagaata tggacgcact 
catgtgtcta atttctatat tatccgtact 
ttcgttttta ccatgcaaaa tcttgcgaaa 
cagtacggtc gtccgacttg gactcgtacc 
aatcgtaagg aatttggtta ttatcatttg 
tatgattttg cagttgaaat tttgaaagat 
agtcaatttc cagccaaagc taaacgtccg 
aaagctactg gatttgttat tccaacttgg 
gaagtgagat aa 

<210> 41 
<211> 1224 
<212> DNA 

<213> Streptococcus pneumoniae 



ttaggaacgg aacttcgcta tttattggat 60 
gtggctaaga tggacattac caatgaagaa 120 
ccgactttag tctaccattg tgcagcctac 180 
aaagagttgg acttcgccat caatgtgacg 240 
aagcatggtg caactctagt ttatatttct 300 
gttggacaag agtgggaagt tgatgaccga 360 
aagcgtatgg gggaagagtt agttgagaag 420 
gcctgggtat ttggaaatta tggcaaaaac 480 
actcataaga ctttaacagt tgtaaatgat 540 
ttggctgagt tcatgaccta cctagctgaa 600 
tcaaatgatg cgacagaaga cacaacatgg 660 
acagatgtcg aagtcaagcc agtagattcc 720 
ctaaactcaa cgatgagcct ggccaaagcc 780 
caagatgcat tgcaagaatt ttacaaacaa 840 

852 
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<400> 41 

atgaaacgtt ctctcgactc aagagtcgat tacagtttgc tcttgccagt attttttcta 60 
ctggtcatcg gtgtggtggc tatctatata gccgttagtc atgattaccc caataatatt 120 
ctgcccattt tagggcagca ggtcgcctgg attgccttgg ggcttgtgat tggttttgtg 180 
gtcatgctct ttaatacaga atttctttgg aaggtgaccc cctttctata tattttaqac 240 
ttgggactta tgatcttgcc gattgtattt tataatccaa gcttagttgc atcaacgggt 300 
gccaaaaact gggtatcaat aaatggaatt accctattcc aaccgtcaga atttatgaag 360 
atatcctata tcctcatgtt ggctcgtgtc attgtccaat ttacaaagaa acataaggaa 420 
tggagacgca cggttccgct ggactttttg ttaattttct ggatgattct ctttaccatt 480 
ccagtcctag ttcttttagc acttcaaagt gacttgggga cggetttggt ttttgtagcc 540 
attttctcag gaatcgtttt attatcaggg gtttcttgga aaattattat cccagtattt 600 
gtgactgctg taacaggagt • tgctggtttc ttagctatct ttattagcaa ggacggacga 660 
_ gcttttcttc accagattgg aatgccgacc taccaaatca atcggatttt ggcttggctc 720 
ID aatccctttg agtttgccca aacaacgact taccagcagg ctcaagggca gattgccatt 780 
gggagtggtg gcttatttgg tcagggattt aatgcttcga atctgcttat cccagttcga 840 
gagtcagata tgatttttac ggttattgca gaagattttg gctttattgg ctctgtcctg 900 
gttattgccc tctatctcat gttgatttac cgtatgttga agattactct taaatcaaat 960 
aaccagttct acacttatat ttccacaggt ttgattatga tgttgctctt ccacatcttt 1020 
IV gagaatatcg gtgctgtgac tggactactt cctttgacgg ggattccctt gcctttcatt 1080 
tcgcaagggg gatcagctat tatcagtaat ctgattggtg ttggtttgct tttatcgatg 1140 
• agttaccaga ctaatctagc tgaagaaaag agcggaaaag ttccattcaa acggaaaaaq 1200 
gttgtattaa aacaaattaa ataa y 1224 

25 <210> 42 
<211> 609 
<212> DNA 

<213> Streptococcus pneumoniae 
30 <400> 42 

atgggaaaaa tcatcggaat cactggggga attgcctcag gtaagtcaac tgtgacaaat 60 
tttctaaaac accaagggct ttcaagcagt ggattgccga cgcagtgttc "caccaactac 120 
agaaaacctg gtggtcgtct gtttgaggct ttagtacagc actttgggca agaaatcatt 180 
cttgaaaacg gagaactcaa tcgccctctc atagctagtc tcatcttttc aaatcctgaa 240 
gagcaaaaat ggtctaatca aattcaaggg gagattatcc gtgaggaact ggctactttg 300 
agagaacagt tggctcagac agaagagatt ttcttcatgg atattcccct actttttgaa 360 
caggactaca gcgattggtt tgctgagact tggttggtct atgtggaccg agatgcccaa 420 
gtagaacgct taatgaaaag ggaccagttg tccaaagatg aagctgagtc tcgtatggca 480 
gcccagtggc ctttagaaaa aaagaaagat ttggccagcc aggttcttga taataatggc 54 0 
aatcagaacc agcttcttaa tcaagtgcat atccttcttg agggaggtag gcaagatgac 600 
agagattaa 609 



40 



<210> 43 
<211> 1260 
45 <212> DNA 

<213> Streptococcus pneumoniae 



. <400> 43 
atgagaaaaa ttgttatcaa tggtggatta 

50 gctaaaaata gtgttgtggc cttaattcca 
ttggattgtg ttccagatat ttcggatgta 
ggagctactg ttaagcgtta tgacgatgtc 
attccaatgc cttatggtaa aattaacagt 
ctcttaggcc gttttggtga agcgacagtt 

55 cgtccgattg acttacacct taaggcgttt 
ggagataaca tgaagttatc tgctaaagat 
gatacggtta gtgtgggagc aacgattaat 



ccactgcaag gtgaaattac tattagtggt 60 
gctattatat tggctgatga tgtggtgact 120 
gccagtcttg tcgaaatcat ggaattgatg 180 
ttggagattg atccaagagg tgttcaaaat 240 
cttcgtgcat cttactattt ttatgggagc 300 
ggtctaccgg gaggatgtga tcttggtcct 360 
gaagctatgg gtgccactgc tagctacgag 420 
acaggacttc atggtgcaag tatttacatg 480 
acgatgattg ctgcagttaa agcaaatggt 540 
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cgtactatta ttgaaaatgc agcccgtgaa cctgagatta ttgatgtagc tactctcttg 600 
aataatatgg gtgcccatat ccgtggggca ggaactaata tcatcattat tgatggtgtt 660 
gaaagattac atgggacacg tcatcaggtg attccagacc gcattgaagc tggaacatat 720 
atatctttag ctgctgcagt tggtaaagga attcgtataa ataatgttct ttacgaacac 780 
ctggaagggt ttattgctaa gttggaagaa atgggagtga gaatgactgt atctgaagac 840 
agcatttttg tcgaggaaca gtctaatttg aaagcaatca atattaagac agctccttac 900 
ccaggctttg caactgattt gcaacaaccg cttacccctc ttttactaag agcgaatggt 960 
cgtggtacaa ttgtcgatac gatttacgaa aaacgtgtaa atcatgtttt tgaactagca 1020 
aagatggatg cggatatttc gacaacaaat ggtcatattt tgtacacggg tggacgtgat 1080 
ttacgtgggg ccagtgttaa agcgaccgac ttaagagctg gggctgcact agtcattgct 1140 
gggcttatgg ctgaaggtaa aactgaaatt accaatattg agtttatctt acgtggttat 1200 
tctgatatta tcgaaaaatt acgtaattta ggagcggata ttagacttgt tgaggattaa 1260 

<210> 44 
<211> 696 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 44 

atgtcaagaa ttgaattttc accatctttg atgaccatgg atttggacaa attcaaagag 60 
cagattactt ttttgaatga taaagtagca tcttatcata tcgatattat ggatggccat 120 
tttgttccca atattacctt gtctccttgg ttcattcaag aagttcaaaa aattagtgac 180 
acacctttat cagttcatct gatggtcaca gacccaacct tttgggtaga tcaagttctc 240 
gatttacaat gtgagtatat ttgtattcat gctgaagttc tgaatggtct tgcttttcgt 300 
ttgattgata aaattcatga tgcaggtcta aaggctggtg ttgtccttaa tcctgaaaca 360 
cctgtttcta caatctttcc ctacattgat ttacttgaca aagtaactat tatgactgta 4 20 
gatccaggtt ttgcaggaca acgctttttg gagtctacct tgtataaaat ccaagaactc 480 
cgtcagctta gagttcagaa tggttatcac tacatcattg agatggatgg ttcttcgagt 540 
cgtaagactt tcaaacaaat tgatgtggca ggaccagata tttatgttat aggtcgcagt 600 
ggattatttg gtttggatga cgatattgcc aaagcctggg atatctgttc tagagattac 660 
gaagaaatga ccggaaaaac aatgccaatc aaataa 695 

<210> 45 
<2il> 1125 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 45 

atgagaaata tggctttgac agcaggtatc gttggtttgc caaacgttgg taaatcaaca 60 
ctatttaatg caattacaaa agcaggagca gaggcagcaa actacccatt tgcgactatt 120 
gatccaaatg ttggaatggt ggaagatcca gatgaacgcc tacaaaaact aactgaaatg 180 
ataactccta aaaagacagt tcccacaaca tttgaattta cggatattgc agggattgta 240 
aaaggagctt caaaaggaga agggctaggg aataaattct tggccaatat tcgtgaagta 300 
gatgcgattg ttcacgtagt tcgtgctttt gatgatgaaa atgtgatgcg cgagcaagga 360 
cgtgaagacg cctttgtaga tccacttgca gatattgata caattaatct ggaattaatt 420 
cttgctgact tagaatcagt gaacaaacga tatgcgcgtg tagaaaagat ggcacgtacg 480 
caaaaagata aagaatcagt agcagaattc aatgttcttc aaaagattaa accagtccta 540 
gaagacggga aatcagctcg taccattgaa tttacagatg aggaacaaaa ggttgtcaaa 600 
ggtcttttcc ttttgacgac taaaccagtt ctttatgtag ctaatgtgga cgaggatgtg 660 
gtttcagaac ctgactctat cgactatgtc aaacaaattc gtgaatttgc agcgacagaa 720 
aatgctgaag tagtcgttat ttctgcgcgt gctgaggaag aaatttctga attggatgat 780 
gaagataaaa aagagtttct tgaagccatt ggtttgacag aatcaggtgt agataagttg 840 
acgcgtgcag cttaccactt gcttggattg ggaacttact tcacagctgg tgaaaaagaa 900 
gttcgcgctt ggactttcaa acgtggtatg aaggctcctc aagcagctgg tattatccac 960 
tcagactttg aaaaaggctt tattcgtgca gtaaccatgt catatgaaga tctagtgaaa 1020 
tacggatctg aaaaggccgt aaaagaagct ggacgcttgc gtgaagaagg aaaagaatat 1080 
atcgttcaag atggcgatat catggaattc cgctttaatg tctaa 1125 
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<210> 46 
<211> 333 
<212> DNA 
. 5 <213> Streptococcus pneumoniae 

<400> 46 

atggaaatcg aaaaaaccaa tcgtatgaat gcgctctttg aattttatgc ggcgcttttg 60 

acagataagc aaatgaatta tatagagctt tactacgctg atgattacag tcttgctgag 120 

10 atagctgagg agtttggtgt tagtcgtcag gctgtctatg acaatatcaa gcgaacagaa 180 

aagattctgg aagattatga gatgaaattg cacatgtact cggactacat tgtccgtagt 24 0 

cagatttttg accaaatctt ggagcgctat cccaaggatg attttctgca ggagcagata 300 
gaaattttaa caagcattga taatagagaa taa 333 

15 <210> 47 
<211> 672 
<212> DNA 

<213> Streptococcus pneumoniae 
20 <400> 47 

atgaccttag aatgggaaga atttctagat ccttacattc aagctgttgg tgagttaaag 60 
attaaacttc gtggtattcg taagcaatat cgtaagcaaa ataagcattc tccaattgag 120 
tttgtgaccg gtcgagtcaa gccaattgag agcatcaaag aaaaaatggc tcgtcgtggc 180 
attacttatg cgaccttgga acacgatttg caggatattg ctggcttacg tgtgatggtt 24 0 
25 cagtttgtag atgacgtcaa ggaagtagtg gatattttgc acaagcgtca ggatatgcga 300 
atcatacagg agcgagatta cattactcat agaaaagcat caggctatcg ttcctatcat 360 
gtggtagtag aatatacggt tgataccatc aatggagcta agactatttt ggcagaaatt 420 
caaattcgta ctttggccat gaatttctgg gcaacgatag aacattctct caactacaag 480 
taccaagggg atttcccaga tgagattaag aagcgactgg aaattacagc tagaatcgcc 540 
catcagttgg atgaagaaat gggtgaaatt cgtgatgata tccaagaagc ccaggcactt 600 
tttgatcctt tgagtagaaa attaaatgac ggtgtaggaa acagtgacga tacagatgaa 660 
gaatacaggt aa * ^72 



30 



40 



45 



50 



55 



<210> 48 
35 <211> 588 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 48 

atggaactta atacacacaa tgctgaaatc ttgctcagtg cagctaataa gtcccactat 60 

ccgcaggatg aactgccaga gattgcccta gcagggcgtt caaatgttgg taaatccagc 120 

tttatcaaca ctatgttgaa ccgtaagaat ctcgctcgta catcaggaaa acctggtaaa 180 

acccagctcc tgaacttttt taacattgat gacaagatgc gctttgtgga tgtgcctggt 240 

tatggctatg ctcgtgtttc taaaaaggaa cgtgaaaagt gggggtgcat gattgaggag 300 

tacttaacga ctcgggaaaa tctccgtgcg gttgtcagtc tagttgacct tcgtcatgac 360 

ccgtcagcag atgatgtgca gatgtacgaa tttctcaagt attatgagat tccagtcatc 420 

attgtggcga ccaaggcgga caagattcct cgtggtaaat ggaacaagca tgaatcagca 480 

atcaaaaaga aattaaactt tgacccaagt gacgatttca tcctcttttc atctgtcagc 540 

aaggcaggga tggatgaggc ttgggatgca atcttagaaa aattgtga 588 

<210> 49 
<211> 294 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 49 

atgaaaacaa gaaaaatccc tttgcgcaag tctgttgtgt ctaacgaagt gattgataag 60 
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cgtgatttgc tccgcattgt taagaacaag gaaggacaag tctttattga tcctacgggc 120 

aaggccaatg gccgcggcgc ttatatcaaa ctagacaatg cagaagccct agaggcgaaa 180 

aagaagaagg tctttaaccg cagctttagc atggaagtgg aagaaagctt ttatgacgag 240 

^ ttgatcgctt atgtggatca caaagtgaaa agaagagagt tgggacttga ataa 294 

<210> 50 
<211> 312 
<212> DNA 

<213> Streptococcus pneumoniae 

10 

<400> 50 

atgttaaaac cctctattga taccttgctc gacaaggttc cttcaaaata ttcactcgta 60 
atcttggaag caaaacgtgc ccacgaattg gaagcaggtg ccccagcaac tcaaggtttc 120 
aagtctgaaa aatcaactct tcgcgcttta gaagaaatcg aatcaggaaa cgttacaatt 180 
15 cacccagatc cagaaggaaa acgtgaagca gtgcgtcgcc gtatcgaaga agaaaaacgc 240 
cgcaaagaag aagaagaaaa gaaaatcaaa gagcaaattg ctaaagaaaa agaagatggt 300 
gaaaaaattt aa 312 

<210> 51 
20 <211> 312 
<212> DNft 

<213> Streptococcus pneumoniae 
<400> 51 

25 atgtcattaa catcaaaaca acgtgccttc ctcaacagcc aggcacacac cctcaaacct 60 
atcatccaaa tcgggaaaaa tggactcaac gaccaaatca aaaccagcgt ccgtcaagct 120 
cttgatgcgc gtgaattaat caaggttact ctcttacaaa acacagatga aaacatccac 180 
gaagtagctg aaattttgga agaagaaatc ggtgtggata cagtccaaaa aataggacgc 240 
atcttgattt tgtttaaaca atctagcaag aaagaaaatc gcaagatttc taagaaagtc 300 

30 aaagaaatct aa 312 

<210> 52 
<211> 528 
<212> DNA 
35 <213> Streptococcus pneumoniae 



<400> 52 

atggcgattg aaaattatat accagatttt 
ccaagcctgc aggcgcaggg aatcaaggct 

40 gcttggaaca accctgatgg aacgccagag 
gcgggtattg gcattatcgt agtgtcaaat 
gagaaatttg ggattgatta cgtttactgg 
cgtgctatga aggaattcca ctatgacaaa 
atgacagata tacgagcagc ccaccgtgca 

45 gtccaacatg actcaatcaa aacgcagatt 
aaaatcactg aaaagtacgg accgattaca 

<210> 53 
<211> 1368 
50 <212> DNA 

<213> Streptococcus pneumoniae 



gctgtggaag cagtctatga tctgacagtc 60 
gttttggtcg atttggataa taccctcatt 120 
atgaagcaat ggctacatga ccttcgggac 180 
aacaccaaaa aacgcgttca acgagcagtt 240 
gccttgaagc ccttcacatt tggtattgac 300 
aaggaagtgg tcatggttgg tgaccagctc 360 
gggattcggt caattttagt caaacccttg 420 
aaccgaactc gtgagcgtcg tgttatgaga 4 80 
tataaaaaag gaatttaa 528 



<400> 53 

atgtttcgaa aaattttaat tgccaatcgt 
55 gcacgtgaat tggggattgc gacggtagcg 
catacgctgt tggcagatga agcagtttgt 
ctcaatatta atgcagttct atcagctgca 



ggtgaaattg cggttcgtat tatccgtgcg 60 
gtttattcaa ctgctgataa ggaagctctt 120 
attggtcctg gcaaggcaac agagtcttat 180 
gtcttgactg aggcagaagc tattcaccct 240 
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ggttttggat ttctcagtga aaattccaaa tttgcgacca tgtgtgaaga aataggtatc 300 
aagtttatcg gtccatctgg tcatgttatg gatatgatgg gggataaaat caatgcgcgt 360 
gctcagatga ttaaagcagg tgtgcctgtt ataccaggtt cagatggaga agtgcataac 420 
tctgaagaag ctttgattgt tgctgaaaaa attggctatc ctgttatgct caaggcttca 480 
gcaggtggag gtggtaaagg gattcgtaag gttgaaaaac cagatgacct cgtttctgcc 540 
tttgaaactg cctctagtga ggccaaggcc aattatggca atggtgccat gtacatagaa 600 
cgggttatct atccagctcg gcacattgag gttcaaatcc taggtgatga gcatggacat 660 
gtgattcact tgggtgaacg ggattgttct cttcaaagga ataaccaaaa ggttttggaa 720 
gaaagtccct cgattgcaat cggaaaaacg ctgcgtcatg aaataggtgc tgctgctgtt 780 
cgagcggcag agtttgttgg ctatgagaat gcaggaacca ttgaatttct tcttgatgaa 840 
gcaagtagca atttctattt catggagatg aatactcgtg ttcaggtaga acatccagta 900 
acagagtttg tttcaggtgt tgatatcgtt aaggaacaga tttgcattgc ggcaggtcag 960 
cctttgtctg ttaagcaaga agatattgtc ctacgcggtc atgccatcga gtgtcgtatc 1020 
aatgcagaaa acccagcctt taactttgct ccaagtccag gtaagattac taatctctat 1080 
ctgccaagtg gtggagttgg cttgcgcgtg gattcagcag tttatccagg ttataccatt 1140 
ccgccttatt atgatagtat gattgccaaa atcatagtac acggcgaaaa tcgttttgac 1200 
gccttgatga aaatgcaacg tgccctctat gaattagaaa ttgaaggagt gcagaccaat 1260 
gcagatttcc agcttgacct catttcagat cgcaatgtca ttgctgggga ttatgatact 1320 
tgcttcttga tggaaacctt cttacctaaa tatcaagaaa aagaataa 1368 

<210> 54 
<211> 234 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 54 

atgatttaca aagtttttta tcaagaaaca aaagaacgta gcccacgccg tgaaacaaca 60 
cgcgcgcttt acctagacat cgataccagc tcagaacttg agggccgtat cactgctcgc 120 
caacttgtcg aagaaaatcg cccagagtac aatatcgaat atatcgaact cttgtctgac 180 
aaattgctcg attacgaaaa agaaactggc gccttcgaaa ttacggagtt ctaa 234 

<210> 55 - 
<211>. 1011 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 55 

atgaaggata gatatatttt agcatttgag acatcctgtg atgagaccag tgtcgccgtc 60 
ttgaaaaacg acgatgagct cttgtccaat gtcattgcta gtcaaattga gagtcacaaa 120 
cgttttggtg gcgtagtgcc cgaagtagcc agtcgtcacc atgtcgaggt cattacagcc 180 
-tgtatcgagg aggcattggc agaagcaggg attaccgaag aggacgtgac agctgttgcg 240 
gttacctacg gaccaggctt ggtcggagcc ttgctagttg gtttgtcagc tgccaaggcc 300 
tttgcttggg ctcacggact tccactgatt cctgttaatc acatggctgg gcacctcatg 360 
gcagctcaga gtgtggagcc tttggagttt cccttgctag ccctcttggt cagcggcgga 420 
cacacagagt tggtctatgt ttctgaggct ggcgattaca agattgttgg ggagacacga 480 
gatgacgcgg ttggcgaggc ctatgataag gtcggccgtg tcatgggctt gacctatcct 54 0 
gcaggtcgtg agattgacga gctggctcat caggggcagg atatttatga tttcccccgt 600 
gccatgatta aggaagataa tctggagttt tcattctctg gtttgaagtc agcctttatc 660 
aatcttcatc acaatgccga gcaaaaggga gaaagcctgt ctacagaaga tttgtgtgct 720 
tccttccaag cagcagttat ggacattctc atggcaaaaa ccaagaaggc tttggaggaa 780 
tatcctgtta aaaccctatt tgtggcaggt ggtgtggcag ccaataaagg tctcagagaa 840 
cgcttagcag ccgaaatcac agatgtcaag gttatcatcc cccctctgcg actctgcgga 900 
gacaatgcag gtatgattgc ctatgccagc gtcagcgagt ggaacaaaga aaacttcgca 960 
ggctgggacc tcaatgccaa accaagtctt gcctttgata ccatggaata a 1011 

<210> 56 
<211> 1809 
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10 



15 



20 



<212> DNA 

213> Streptococcus pneumoniae 
<400> 56 

atgtgtggaa ttgttggtgt tgttggaaac acaaatgcaa ctgatatttt gattcaaggg 60 
cttgaaaagc ttgaataccg tggctatgat tctgcgggaa tttttgtcct agatggtgct 120 
gataaccatt tggtgaaggc ggttggtcgt attgcagaat tgtctgccaa gacagctggt 180 
gttgagggaa caactggtat cggacatact cgttgggcta ctcacggaaa accaactgag 24 0 
gacaatgctc acccacaccg ctctgagaca gaacgttttg tcttggtgca taatggggtg 300 
attgagaact atcttgaaat caaggaagaa taccttgcag gtcaccactt caaggggcag 360 
acagatactg aaatagccgt tcatttgatt ggaaaatttg cggaagaaga agggctctca 420 
gttcttgaag cctttaaaaa agctcttcat attatccgtg gttcatatgc ctttgccttg 480 
attgactctg aaaatccaga tgtcatctat gtagcgaaaa acaaatctcc acttttgatt 540 
ggtcttgggg aaggctacaa tatggtctgc tcagatgcta tggctatgat tcgtgaaacc 600 
aaccaataca tggaaattca tgaccaagag ttggtaatcg tcaaggctga tagcgtggaa 660 
gttcaagact atgatggtaa cagtcgtgaa cgtgctagct atactgcgga acttgacttg 720 
tcagatatcg gtaagggaac ttatccttac tacatgctta aggaaattga tgagcaacca 780 
actgttatgc gtaaactcat tcaagcctac acggatgatg ctggtcaagt agtggttgct 84 0 
cctgctatca ttaaggctgt tcaagacgca gaccgcatct acatccttgc agctggaaca 900 
tcttaccatg caggatttgc ttctaagaaa atgttggaag aattgacaga tacaccagtt 960 
gaacttggaa tctcatctga gtggggctac ggtatgccac ttctcagcaa gaaaccactc 1020 
ttcatcttta tcagccaatc tggtgaaaca gcggatagtc gtcaagtttt ggtcaaggct 1080 
aatgaaatgg gaattccaag cttaacagtg acaaatgttc caggttcaac cctctcacgt 1140 
gaagccaact ataccatgct ccttcacgca ggacctgaaa ttgccgtggc atcaactaaa 1200 
IS gcctatacag cgcaaatcgc agcccttgcc ttccttgcaa aagcagtcgg agaagcaaat 1260 
ggtaatgcta aagcgcaagc ctttgacctg gttcatgaat tgtcaatcgt agctcagtct 1320 
attgaatcaa ctctttcaga gaaagaaacc attgaagcca aggttcgtga acttcftgaa 1380 
acaactcgta acgcctttta catcggacgt ggtcaagatt actacgtagc catggaagca 1440 
agtctcaaac tcaaagagat ttcttatatc cagtgtgaag gttttgcggc aggagaactc 1500 
aagcacggaa ccattgcctt gattgaagaa ggaacgcctg ttttggctct cttgtcagat 1560 
ccagttcttg ccaaccatac tcgtggaaat atccaagagg tcgcagcccg tggtgccaaa 1620 
gtcctcacta tcgcagaaga gaatgtagcc aaagataccg acgatatcgt ccttacgacc 1680 
gtacatccat acctctcacc aatttcaatg gtcgtaccaa cgcaattagt cgcttacttt 1740 
gcaaccctcc accgtggcct cgatgtggac aaaccacgta accttgccaa gtcagtaacg 1800 
J> gtagaataa 180g 

<210> 57 
<211> 723 
<212> DNA 

<213> Streptococcus pneumoniae 



30 



40 



<400> 57 

atgatacgta tcgaaaatct cagtgtctcc tacaaagaaa cgttggcact taaggatatt 60 
tcactagtgc tccatggacc aacaattacc ggcatcattg gtccaaacgg cgctgggaaa 120 
tcaacactat taaaaggtat gctgggaatt atcccacatc aaggtcaggc atttctcgat 180 
gacaaggaag ttaaaaaatc cttacaccga attgcctatg tcgaacaaaa aatcaatatc 240 
gactacaact ttcccatcaa ggtcaaggaa tgcgtctcgt taggactatt tccctctatt 300 
cctctctttc gaagtttaaa ggctaaacat tggaagaaag tgcaagaggc ccttgaaatc 360 
gtcggcctag ctgactacgc tgaacgtcaa attagtcaac tgtctggagg tcaattccag 420 
cgggtcttga ttgccagatg tttggtgcag gaagccgact atatcctctt ggatgaaccc 480 
tttgctggga ttgactctgt cagtgaggaa atcatcatga atacgctgag agatttgaaa 54 0 
aaagctggga agacggttct catcgttcac cacgacctca gcaagattcc ccactacttc 600 
gatcaagtct tacttgtcaa tcgagaagtg attgcctttg gtccaacaaa agaaactttt 660 
accgaaacca atctaaaaga agcttacggt aatcaactct ttttcaatgg aggtgaccta 720 
55 tga ?23 

<210> 58 
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<211> 2223 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 58 

atgccgaaag aagtgaattt aacaggcgaa gaagttgtcg ctttaaccaa agaatattta 60 
acggaagagg atgttcattt tgtccataag gccttggtct atgctgttga atgccacagt 120 
ggtcaatatc gcaaatcagg cgagccttat atcattcacc ctatccaagt ggcaggtatt 180 
ttagctaagc taaagctgga tgctgtaaca gtagcttgtg gattcttgca tgatgtggtg 240 
gaagatacag atgcgacctt ggacgatttg gaaagagagt ttggtcctga tgtgcgggtg 300 
attgttgacg gagttaccaa gcttggcaag gtcgagtaca aatcgatcga ggagcaatta 360 
gcggaaaatc atcgcaagat gctcatggcc atgtctgagg acatccgcgt tattttggtc 420 
aaactgtctg accgcttgca caatatgc'gg accctgaaac atcttcgaaa agacaagcag 480 
gagcgtattt ccaaagaaac catggaaatc tatgccccac ttgcccatcg tttggggatt 540 
tccagtgtca aatgggaatt agaagacttg tctttccgtt atctcaatcc aacggagttt 600 
tacaagatta cccatatgat gaaggaaaag cgcagggagc gtgaggcctt ggtggatgag 660 
gtagtcacaa aattagagga gtatacgaca gaacgtcact tgaaagggaa gatttatggt 720 
cgtcccaagc atatttactc aattttccgc aaaatgcagg acaagagaaa acggtttgag 780 
gaaatctatg atctgattgc tattcgttgt attttagata cccaaagtga tgtttatgcc 840 
atgcttggtt acgtgcatga attttggaaa ccgatgccag gtcgcttcaa agactatatt 900 
gccaaccgca aggccaatgg ttatcagtct atccatacga ctgtttatgg accaaaaggg 960 
ccgattgaat tccagattcg aaccaaggaa atgcacgagg tggctgagta cggggttgcg 1020 
gctcactggg cttataagaa aggtataaag gggcaagtta acagcaagga atcagct'att 1080 
ggaatgaact ggatcaagga gatgatggag ctccaagacc aggctgatga tgctaaggaa 1140 
tttgtggact ctgttaagga aaactatctg gctgaggaga tttacgtttt taccccagat 1200 
ggagctgtcc gttcccttcc caaagattca ggaccgattg attttgccta cgaaatccat 1260 
accaaggtcg gtgaaaaagc aactggtgcc aaggtcaatg gccgcatggt tccactgaca 1320 
accaagttaa agacagggga tcaggttgaa attatcgcca acccgaactc ctttggacct 1380 
agccgtgact ggctcaatat ggtcaagact agcaaggcgc gcaataagat tcgccagttc 14 40 
tttaaaaacc aagataagga attgtctgtc aacaagggtc gtgagatgct gatggctcag 1500 
ttccaagaaa atggctatgt ggcaaataaa tttatggaca agcgccacat ggatcaagtt 1560 
ctgcaaaaga ccagttacaa gacagaagac tccctctttg cggccattgg ttttggggaa 1620 
atcggtgcga ttaccgtctt taaccgtctg actgaaaagg aacgccgtga ggaagagcgt 1680 
gccaaggcca aggctgaggc agaggagctt gtcaaaggtg gcgaggtcaa ggttgaaaat 1740 
aaagaaactc tcaaggtcaa gcatgagggg ggagtggtta ttgaaggtgc ttctggtctc 1800 
ctagtgcgga ttgctaagtg ttgtaacccc gtgcctggtg acgatattgt tggctacatt. 1860 
accaagggtc gtggtgtggc tattcaccgt gtggactgta tgaacctgcg tgcccaagaa 1920 
aactacgagc aacgtctcct tgatgtggaa tgggaagacc agtactctag ctcaaataag 1980 
gagtatctgg cccatatcga tatctacggt ctcaaccgta caggactgtt gaacgatgta 2040 
ctgcaagttc tttcaaatac aaccaagaat atttcaacgg tcaatgccca accaaccaag 2100 
gatatgaagt ttgctaatat ccatgtgtcc ttcggtattg ccaacctctc tacactgacc 2160 
acggttgtcg ataaaattaa gagtgtgcca gaagtttact ctgtcaaacg gaccaacggc 2220 
ta 9 2223 

<210> 59 . 
<211> 1479 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 59 

atgtctaatt gggacactaa atttttgaaa aaaggtttta cctttgatga tgtattgctt 60 
attccagctg aaagtcatgt gttgcctaac gatgcagatt taacaactaa attggcagat 120 
aatttgactt taaatatccc aattattacc gctgccatgg acacagttac agagagtcaa 180 
atggccattg ctattgctcg tgcaggcggt ctcggagtta tccataaaaa catgtcaatt 24 0 
gctcaacaag cagacgaggt tcgtaaggta aaacgttctg aaaatggagt tattattgat 300 
ccgttcttct tgacgcctga acatacaatt gctgaagcag atgagcttat gggtcgttac 360 
cgcatcagtg gtgttccagt tgttgaaaca cttgaaaatc gtaaattggt tggtattttg 420 
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acaaaccgag atcttcgttt tatttcagat 
agtgaaaatc ttgttactgc tcctgtgggt 
caagagcatc gtattgaaaa acttccgttg 
atcactatca aagatattga aaaagttatt 
5 ggtcgtcttc tagtt'gcagg tgcagtaggt 
gctctttttg aggcaggagc ggatgcgatt 
ggtgtcttgc gtaaaattgc cgagattcgt 
ggaaatattg ctactgctga aggtgcacgt 
aaggttggta ttggaccagg ttctatctgt 

10 ccgcaagtaa cagctatcta cgatgctgca 
attgctgacg gtgggatcaa gtattctgga 
aatgctgtta tgcttggatc tatgtttgct 
atcttccaag gacgtaaatt caagacttac 
aaaggttcaa gcgaccgtta tttccaaggt 

15 gaaggaattg aaggtcgtgt tgcttataaa 
attggtggta ttcgctctgg tatgggttac 
gataatgctc aatttattga aatgtctggt 
gtgcaaatta ctaatgaggc accaaattat 

20 <210> 60 

<211> 1947 
<212> DNA 

<213> Streptococcus pneumoniae 

25 <400> 60 

atgacagaag aaatcaaaaa tctgcaggca 
ttagagggct tagaggctgt tcgtatgcgt 
gaaggtcttc accatctagt ctgggaaatt 
ggatttgcca gccatattca agtttttatt 
30 gatgggcgtg gtatcccagt cgatattcag 
gtctttacag tccttcacgc tggaggaaag 
ggtcttcacg gggtggggtc gtcagtagtt 
gttcacaaaa atggtaagat tcattaccaa 
cttgaaatag ttggagatac ggataaaaca 
35 aaaatcttca ctgaaacaac aatetttgat 
ttggcctttc taaatcgcgg tcttcaaatt 
caaaccaagc attatcatta tgaaggtggg 
aacaaggatg taatctttga tacaccaatc 
gttgaggtag ccatgcagta cacaactggt 
40 aatattcata cacatgaagg tggaacgcat 
gttatcaatg attatgctcg taaaaataag 
ggggaagatg ttcgcgaagg cttaactgca 
tttgaaggac aaaccaagac caaattggga 
ctcttcagtg aagccttctc cgatttcctc 
45 gtggaaaaag ggattttagc tgccaaggct 
acacgtaaaa aatctggttt ggaaatttcc 
tctaataacc ctgctgaaac agaactcttc 
gccaaatctg gtcgtaaccg tgagtttcag 
aacgttgaaa aagcaagtat ggataagatt 
50 acagccatgg gaacaggatt tggcgcagaa 
ctcgttttga tgaccgatgc cgatgtcgat 
ttgatttatc gttatatgaa accaatccta 
ccaatctatg gtgtcaaggt tggaagcgag 
caagaaatca aactccaaga agctttagoc 
55 attcagcgtt ataaggggct aggtgaaatg ■ 
gatcccgaac atcgcttgat ggctagagtt 
atctttgata tgttgatggg ggatcgagta 



tataatcaac caatttcaaa ccatatgact 480 
acggatcttg caacggctga gagtattctt 540 
gtcgatgaag aaggcagtct ttctggtttg 600 
gagtttccaa atgcggctaa agatgagttt 660 
gttacttcag atacatttga acgtgcagag 720 
gttattgata ctgcacatgg tcattctgca 780 
gctcatttcc cagatcggac tttgattgct 840 
gccctttatg aagcgggtgt agacgttgtt 900 
actactcgtg tgattgctgg tgttggtgtt 960 
gctgttgcgc gcgaatatgg taaaacgatt 1020 
gatattgtaa aagcacttgc tgcaggtgga 1080 
ggaactgatg aagctccagg cgaaactgaa 1140 
cgtggtatgg gatcaattgc tgctatgaag 1200 
tctgtcaatg aagcaaacaa gcttgttcca 1260 
ggagcggcag ctgatattgt tttccaaatg 1320 
tgtggtgcag ctaaccttaa agaactacac 1380 
gctggtttga aagaaagcca tcctcatgat 1440 
tctatgtaa 1479 



caggattatg atgccagtca aattcaagtt 60 
ccagggatgt acattggatc aacctcaaaa 120 
gttgataact caattgacga ggccttggca 180 
gagccagatg attcgattac tgttgtggat 24 0 
gaaaaaacag gccgtcctgc tgttgagacc 300 
ttcggcggtg gtggatacaa ggtttcaggt 360 
aatgcccttt ccactcaatt agacgttcat 420 
gaataccgtc gtggtcatgt tgtcgcagat 480 
ggaacaactg ttcacttcac accggaccca 54 0 
tttgataaat taaataaacg gattcaagag 600 
tcaattacag ataagcgcca aggtttggaa 660 
attgctagtt acgttgaata tatcaacgag 720 
tatacagacg gtgagatgga tgatatcaca 780 
taccatgaaa atgtcatgag tttcgccaat 840 
gaacaaggtt tccgtacagc cttgacacgt 900 
ttactgaaag acaatgaaga caacctaaca 960 
gttatctcag ttaaacaccc aaatccacag 1020 
aatagcgaag tggtcaagat taccaatcgc 1080 
atggaaaatc cacagattgc caaacgtatc 1140 
cgtgtggctg ccaagcgtgc gcgtgaagtc 1200 
aaccttccag ggaaactagc agactgttct 12 60 
atcgtcgaag gagactcagc tggtggatca 1320 
gctatccttc caattcgcgg taagattttg 1380 
ctagctaacg aagaaattcg tagtcttttc 1440 
tttgatgttt cgaaagcccg ttaccaaaaa 1500 
ggagcccaca ttcgtaccct tcttttaacc 1560 
gaagctggtt atgtttatat tgcccaacca 1620 
attaaagaat atatccagcc gggtgcagat 1680 
cgttatagtg aaggtcgtac caaaccgact 1740 
gacgatcatc agctgtggga aacaaccatg 1800 
tctgtagatg atgctgcaga agcagataaa 1860 
gagcctcgtc gtgagtttat cgaagaaaat 1920 
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gctgtctata gtacacttga tgtctaa 194 

<210> 61 

<211> 267 

5 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 61 

atgggattta ctgaagaaac agtacgtttt aaattggacg attccaataa aaaagaaatt 60 
10' agcgaaactt tgacagatgt ttatgcttcg ttgaacgata agggttacaa cccaattaac 120 
caaatcgtag gttacgtatt gagtggagac cctgcctacg ttcctcgtta taataatgca 180 
cgaaatcaaa tccgtaagta tgagcgtgat gaaatcgttg aggaattggt tcgctactac 240 
ctcaaaggac aaggagtcga tctataa 267 

15 <210> 62 
<211> 597 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 62. 

atggtcaact atccacataa agtttcatca caaaaaagac aaacatctct ttctcaaccc 60 
aaaaatttcg caaatcgagg aatgtctttt gaaaagatga tcaatgctac caacgactac 120 
tatttgtctc agggcttggc tgttatacat aagaaaccaa ctcctattca aatcgtacaa 180 
gtggactatc cacaacgaag tcgtgccaag attgttgaag cctattttcg acaagcttca 240 
acgacggact attctggcgt ttataatgga tattacatcg actttgaagt- caaggaaaca 300 
aaacaaaaac gtgcgattcc gatgaaaaat tttcatccac atcagattca gcatatggaa 360 
caagtccttg cccaacaagg aatctgcttt gtccttcttc acttttcttc tcagcaagaa 420 
acctacttat tgccggcatt cgatttgatt cgcttctatc atcaagataa gggacaaaaa 480 
tcaatgccac ttgaatatat tcgagaatat ggatatgaaa tcaaggctgg tgccttccct 540 
caaattcctt atctcaatgt tatcaaagaa catttattag gtggtaaaac aagatga 597 

<210> 63 

<211> 867 

<212> DNA 

35 <213> Streptococcus pneumoniae 

<400> 63 

atggctctat ttagtaaaaa agataagtat attcgaatca atcccaatcg ttcggttagg 60 
gaaaaacctc aagctaagcc agaggttcca gatgaattat tttcccagtg tccaggctgt 120 
aagcatacca tctatcagaa ggatctggga agtgaacgta tctgtccgca ctgtagctat 180 
acctttcgta tttctgccca agaacgcttg. gctttgacga ttgatatggg aaccttcaaa 240 
gaattgttta cagggattga aagcaaggat cccttgcatt tccctggtta ccaaaagaaa 300 
ctggcatcta tgcgtgaaaa aacaggtctg catgaagccg ttgtgacagg aactgctctt 360 
attaaaggtc agactgtggc tcttgggatt atggattcta actttatcat ggcttctatg 420 
ggtacggttg taggtgaaaa aatcactcgt ttgtttgagt atgcgactgt cgaaaaattg 480 
ccagttgtcc tattcacagc ctctggtgga gcccgtatgc aggaaggaat catgagtctc 540 
atgcagatgg ptaagatctc tgcggcggtt aaacgccatt caaatgctgg tctcttttac 600 
ctgaccattt tgacagatcc aacgactggt ggtgtgacag cttctttcgc tatggaaggc 660 
gatatcattc tggctgaacc acagagcttg gttggttttg ctggacgtcg tgtgattgaa 720 
aatacggttc gtgaaagctt gcctgaggat ttccaaaagg cagaattcct attagaacat 780 
ggctttgtgg atgctattgt caaaagaaga gacttaccag atacgattgc tagcctagtc 840 
agattgcatg gagggagtcc tagatga gg7 

<210> 64 
55 <211> 420 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 64 

atgagaatta tgggattgga cgtcggttca aaaacggtag gggtggcgat tagcgatccg 60 
cttggtttta cagctcaagg gcttgaaatc atccagataa atgaagaaca aggccaattt 120 
ggttctgacc gcgttaagga attggttgat acttacaagg tggaacgatt tgtagtgggc 180 
ttgcctaaaa acatgaacaa tacaagtgga ccgcgcgtag aagctagtca agcatacgga 240 
gcaaagctag aagagttttt tggtttacca gtagactatc aggatgaacg cttgacaaca 300 
gtggctgctg agcgcatgtt gattgaacaa gcagatatca gtcgcaataa gcgcaagaaa 360 
gtcattgata agttagcagc tcagctgatt ttacaaaatt atttagatag aaaattttaa 420 

<210> 65 
<211> 1197 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 65 

atggcaaaac ttactgttaa agacgttgac ttgaaaggta aaaaagtcct" cgttcgtgtt 60 
gacttcaacg taccattgaa agatggcgta atcactaacg ataaccgtat cacagcagct 120 
cttccaacta ttaagtacat catcgaacaa ggtggacgtg caattctttt ctctcacctt 180 
ggacgtgtga aagaagaagc tgataaagct ggtaaatcac ttgctcctgt agcagcagac 240 
ttggcagcaa aacttggtca agatgttgtt' ttcccaggtg tcactcgtgg tgctgaattg 300 
gaagcggcaa tcaacgctct tgaagatgga caagttctct tggttgaaaa cactcgttac 360 
gaagatgttg acggcaagaa agaatctaaa aacgatcctg aacttggtaa atactgggca 420 
tcacttggag atggtatctt cgtaaacgat gcattcggta cagctcaccg tgcacacgca 480 
tctaacgttg gtatctcagc aaacgttgaa aaagcagttg ctggtttcct tcttgaaaac 540 
gaaattgcct acatccaaga agcagttgaa actccagaac gtccattcgt ggctatcctt 600 
ggtggttcaa aagtttcaga caagatcggt gttatcgaaa acttgcttga aaaagctgat 660 
aaagtcctta tcggtggtgg gatgacttac acattctaca aagcacaagg tatcgaaatc 720 
ggtaactcac ttgtagaaga agacaaattg gatgttgcga. aagctcttct tgaaaaagca 780 
aatggtaaat tgatcttgcc agttgactca aaagaagcta acgcatttgc tggttacact 840 
gaagtgcgtg acactgaagg tgaagcagtt tctgaaggct tccttggtct tgacatcggt 900 
ccaaaatcta tcgccaaatt tgacgaagct ttgactggtg ccaaaacagt tgtatggaac 960 
ggacctatgg gtgtatttga aaacccagat ttccaagctg gtacaatcgg tgtgatggac 1020 
gctatcgtga aacaaccagg agttaaatca atcatcggtg gtggtgactc agctgccgca 1080 
gcgattaacc ttggccgtgc agacaagttc tcatggatta gtacgggtgg tggagcatca 1140 
atggaacttc ttgaaggtaa ggttcttcca caacttgcag ccttgacaga aaaataa 1197 

<210> 66 
<211> 498 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 66 

atgttaaaat cagaaaaaca atcacgttat caaatgttaa atgaagaatt gtccttccta 60 
ttggaaggcg aaaccaatgt tttggctaat ctttccaacg ccagtgctct cataaaatca 120 
cgttttccta ataccgtatt tgcaggcttt tatttgttcg atggaaagga attggtttta 180 
ggccccttcc aaggaggtgt ttcctgcatc cgtattgcac taggcaaggg tgtttgtggt 240 
gaggcagctc actttcagga aactgttatt gttggagatg tgacgaccta tctcaactat 300 
atttcttgtg atagtctagc taaaagtgaa attgtggtgc cgatgatgaa gaatggtcag 360 
ttacttggag ttctggatct ggattcttca gagattgagg attacgatgc tatggatcga 420 
gattatttgg aacaatttgt cgctattttg cttgaaaaga cagcatggga ctttacgatg 480 
tttgaggaaa aatcttaa 49Q 

<210> 67 
<211> 630 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 67 

atgacaatcg aactattgac tccctttacc aaggtagagt tggagccaga aatcaaggag 60 
aaaaaacgca^aacaagttgg gattttaggg gggaatttta accctgttca caatgcccat 120 
ctcattgttg cggatcaagt acggcaacag ttgggactgg atcaagttct gctcatgcct 180 
gaataccaac ctcctcacgt tgataaaaag gaaaccatcc ctgaacacca tcgtctcaag 24 0 
atgcttgagt tggcaattga gggaattgac ggcctagtca ttgaaaccat tgagttggag 300 
cgcaagggta tttcctacac ctacgacacc atgaagattt tgacagagaa gaatccagat 360 
acggattatt actttatcat cggtgccgac atggttgact atctgcctaa gtggtaccga 420 
attgatgaac tggttgacat ggttcagttt gtgggggttc agcgtccacg ctacaaggta 



480 



gggacttcct atccagttat ctgggtggac gtaccgctca tggatatctc gtccagcatg 54 0 
gtgcgtgcct tccttgccca a.ggtcggaaa cccaactttc tcctacctca gccagtgcta 600 
gactacatcg agaaggaggg gctctactga - 63Q 

15 <210> 68 
<211> 768 
<212> DNA 

<213> Streptococcus pneumoniae 
20 <400> 68 

atgaatattg caaaaatagt cagagaagcg cgtgagcaga gtcgcttgac aaccttggac 60 
tttgcgacag gcatttttga tgaatttatc caattacatg gtgaccgttc ttttcgtgat 120 
gatggtgcag ttgttggtgg tattggttgg cttggagacc aagctgtaac agtggttggt 180 
atccaaaaag gcaagagttt gcaagacaac ctcaaacgga attttggcca accacatcca 240 
O gaaggctacc gaaaggcact gcggttgatg aaacaggctg agaaatttgg ccgtccagtt 300 
gtgaccttta tcaatacagc aggtgcttat cctggtgtcg gagcggaaga acgtggtcaa 360 
ggggaagcta ttgctcgcaa tctcatggaa atgagtgacc tgaaagttcc tattatcgcc 420 
attattatcg gtgaaggtgg ttcaggcggg gctctggctc tagctgtcgc ggaccgtgtc 480 
tggatgctgg aaaattctat ctatgccatt ctcagtccag aaggctttgc ttccatttta 54 0 
tggaaggacg gtactcgcgc catggaagca gcagaactga tgaaaatcac ttcgcatgaa 600 
ctgttagaaa tggacgtggt ggataaggtg atttctgaag taggactttc tagtaaagaa 660 
ctgattaaga gtgtcaaaaa agaactccaa acggagctgg ctagactttc acaaaaaccg 720 
ctagaagagt tgctggaaga acgctatcaa cgatttagaa aatactaa 768 

35 <210> 69 
<211> 510 
<212> DNA 

<213> Streptococcus pneumoniae 
40 <400> 69 

atgattataa aagtagaaat ggcagatgtt gaggtgttgg ctaaaattgc caaacaaacc 60 
tttcgtgaaa cctttgcgta tgataatacg gaagagcagt tacaggaata ctttgaagag 120 
gcttatagtc tgaaaacttt gtcaactgag ttgggaaatc ctgactctga aacctatttc 180 
attatgcatg aggaggagat agctggtttt ctcaaagtca actggggaag tgctcaaact 240 
« gagagagaat tagaggacgc ttttgaaatt caacgcctct atgtgctaca aaaattccaa 300 
ggatttggac taggtaagca actgtttgaa ttcgcacttg aacttgctac aaaaaatagt 360 
ttttcttggg cttggctagg tgtttgggag cataatacaa aagctcaagc cttttataat 420 
cgatatggtt ttgaaaaatt tagccaacat cattttatgg ttggtcaaaa agtagatacg 4 80 
gattggttac tgagaaagaa attaaggtaa • 5 10 



<210> 70 
<211> 1590 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 70 

atgttacggg ggactgcttt gctaacggct agtaacttta. tcagtcgcct actcggggct 60 



27 



WO 01/49721 



PCT/US00/35604 



gtttacatta tcccttggta catctggatg ggggcttatg cagctaaggc aaatggtctc 120 
tttaccatgg gttacactat ctatgcttgg ttcttgttgg tttcaacagc ggggattcca 180 
gttgcggtgg ccaagcaagt tgccaagtat aataccatgc gagaagaaga gcatagcttt 240 
gccctgattc ggagcttctt aggctttatg acaggactag gcctggtttt tgctttagtc 300 
ttgtatgtct ttgctccttg gctagcagac ttgtctggcg tgggcaaaga cttgatccca 360 
atcatgcaaa gcttggcttg gggagtcttg attttcccgt ctatgagtgt tatccgagga 420 
tttttccaag ggatgaataa cctcaaaccc tatgccatga gccaaattgc tgagcaggtc 480 
attcgtgtta tctggatgct cctagcaacc tttatcatta tgaagctcgg ttcaggagat 540 
tatctagcag ccgttaccca atcaaccttt gctgcctttg .tcggtatggt agccagtttt 600 
gcagtcttga tttatttcct tgcccaagaa agttcactca aaagagtctt tgaaacagga 660 
gataagatta acagtaagcg tctcttggtt gataccatta aggaagccat tccttttatc 720 
ctgacagggt ctgccatcca gatcttccag attttggatc agctgacctt tatcaatagt 780 
atgagctggt ttaccaacta cagcaatgag gacttggttg tcatgttttc tta.tttctca 840 
gccaatccta ataaaatcac gatgattttg atttctgtag gggtttcgat tgggagtgtt 900 
ggtttgccac ttttgacgga aaactatgtc aagggggact tgaaagcggc ttctcgtctc 960 
gttcaggaca gtctcaccct actctttatg ttcttgctac cagcaacggt tggagtggtt 1020 
atggtaggag aacctcttta tacggtcttc tatggtaagc cagatagttt ggctctgggc 1080 
ttatttgtct ttgcagtttt gcagtctatt attttaggct tgtacatggt cttgtctcca 1140 
atgcttcagg ccatgttccg caaccgcaag gccgttctct attttatcta tggttctatt 1200 
gccaagctag tcttgcaact acctaccatc gccctcttcc acagttatgg tcctttgatt 1260 
tcaacaacca ttgctctcat cattcctaac gtcttgatgt atcgggatat ttgtaaagta 1320 
actggtgtca agcgcaaggt gattttgaag cgaaccattt taatcagttt gctgacccta 1380 
gtcatgtttc tgttaatagg aaccatccag tggctgttag gatttttctt ccaaccaagt 14 40 
ggacgtttgt ggagcttctt ttatgtagct cttgtcggtg ccatgggggg tggactttat 1500 
atggttatga gtctgcgtac ctatttatta gataaggtaa taggaaaagc ccaagcagat 1560 
•cgcctgcgag caaaatttaa gctttcgtaa 1590 

<210> 71 
<211> 468 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 71 

atgtcagata agattggctt attcacaggc tcatttgatc cgatgacaaa tgggcatctg 60 
gatatcattg aacgggcgag cagactcttt gataagctct atgtcggtat tttttttaat 120 
ccccacaaac aaggatttct tcctatcgaa aatcgtaaac gggggctaga aaaggctttg 180 
ggacatctgg aaaatgttga agtcgtggct tctcatgatg aattggtggt cgatgttgca 240 
aaaagattgg gtgctacttg tctagtgcgt ggtttgagga atgcgtcgga tttgcaatat 300 
gaagccagtt ttgattacta caatcatcag ctgtcttctg atatagagac tatttattta 360 
catagtcgac ctgaacatct ctatatcagt tcatcaggcg ttagagagct tttgaagttt 420 
ggtcaggata ttgcctgcta tgttcccgag agtatttgga ggaaataa 4 68 

<2i0> 72 
<211> 432 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 72 

atgacgattt tgtttgtggt tatcagtgct tcctttctgt atatggtttc tcttagcatg 60 
aaaccctatc aaacagctaa aagtgaagga gaaaaattag ctcagcagta tgcaggatta 120 
gagcaggccg atcaggttga tttatacaat ggcttggaat cttattacag cgttcttggt 180 
cgtaataaac agcaagaagc acttgctgtt ctgattggaa aagatgatca taagatttac 240 
gtttatcagc taaatcaggg tgtttcacaa gaaaaagcag aaacggtttc taaggaaaag 300 
ggagctggcg aaattgacaa gattatcttt ggtcgttatc aagataagcc aatctgggaa 360 
gtcaagtcag gatctgattt ttatctagta gattttgaaa caggagcatt ggtcaacaag 420 
gagggcctat ga 4 32 
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<210> 73 
<211> 732 
<212> DNA 

<213> Streptococcus pneumoniae 
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'<400> 73 

atgattgata 

gaaagtaagg 

tctcaccgtc 

caggttcggg 

atttactaca 

aatagtcgtt 

gccttgaata 

gatgttcttg 

caaataaata 

aaaaaaagag 

cataatgtgg 

tacggagaag 

caactaattt 



ttcattcgca 
ctctcttgac 
gcaagggcat 
aaatagctaa 
cgccagatgt 
atgccttgat 
aaatattgat 
aaaataatga 
gttcacatgt 
cgcagtattt 
acggcagacc 
cgaaggctca 
ag 



cattgtcttt 
agaagcctac 
gtttgaaact 
ggaagtcgcg 
tttggataag 
agagtttagt 
gttgggaatt 
aaaacgcgtt 
cctcaaatcc 
cttggagcgt 
cccccatatg 
ggaacttttt 



gatgtagatg 
aggcaggggg 
ccagaagaga 
agtgacttgg 
ctggaaaaca 
atgaacactc 
actcccgtca 
cgagagctga 
aaactttttg 
gatttggttc 
gcagaagcat 
atagacaatc 



atggtcccaa 
tgcgaaccat 
agatagcaga 
tcattgctta 
atcggattcc 
cttatcgcga 
ttgcccacat 
tcgatatggg 
gagaacctta 
atatcattgc 
atgaccttgt 
ctcgaaaaat 



gtcaagagag 60 
tgtctctacc 120 
aaactttctt 180 
tggggctgaa 240 
gaccctcaat 300 
tattcatagt 360 
agagcgctat 420 
ctgttacacg 480 
taaattcatg 540 
aagtgatatg 600 
ttcccaaaaa 660 
tgtaatggat 720 
732 



<210> 74 
<211> 927 
.<212> DNA 

<213> Streptococcus pneumoniae 



<400> 74 

atgtctacaa 

actattgatg 

aaatcaactg 

ccaatgtttt 

tatatcaagc 

gacggtgtct 

aacttggtta 

gtcatgatgg 

ttgacttact 

tggattggta 

aacttggcgt 

cctcaattgg 

tacgaagcct 

cgtttgattt 

ttgacaaccc 

gctagtgatc 



tcgataaaga 
cgccagcata 
tagtcatgtt 
ctaagtttga 
caaatgcgga 
ggttcggagc 
tcggtgtttt 
aagtttacaa 
caatcggagc 
ttgccttcat 
cacgtacttt 
tatctgttat 
tcttgtcttt 
cggattattc 
ttgtcttggt 
cacgtacaca 



aaaatttcag 
ttcttactgg 
gggaatcttg 
tttcaatgat 
gcattggttc 
tcgtaactcc 
tgtcggtggt 
cgtcatctca 
tggattctgg 
gatccgtgtg 
gggaacacca 
tgtgacaacc 
cttcggtctt 
acaaaacgta 
atccttgtcc 
tagatag 



tttgtaaaac 
aaatcagtgt 
gtagccatca 
gtcagcaagg 
ggtactgaca 
atcctcattt 
atttggggta 
aacatcccac 
aatctgattt 
caaatcttgc 
accttgaaga 
atgactcaaa 
ggattaccga 
acaaccaatg 
cttttcgtag 



gtgacgattt 
ttaaacaatt 
ttttgataag 
taaacgactt 
gtaacggtaa 
ctgtgattgc 
tttcaaaatc 
ctcttttgat 
ttgccatgag 
gctatcgtga 
ttgttgccaa 
tgcttccaag 
ttacagtgcc 
cttacttgtt 
ttggtcaaaa 



tgcctctgaa 60 
tatgaagaaa 120 
tttcatctac 180 
tagtgttcgt 240 
atcgctcttt 3.00 
gacagtgatt 360 
agttgaccgt 420 
tgttattgtc 480 
cgtaacaaca 540 
cttggaatac 600 
aaatatcatg .660 
ctttatctca 720 
aagtttgggt 780 
ctggattcca 840 
cttagcggat 900 
927 



<210> 75 
<211> 234 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 75 

atgtataacc tattattaac cattttatta gtattatctg ttgtgattgt gattgcaatt 60 
ttcatgcaac caaccaaaaa ccaatccagc aatgtatttg atgccagttc aggtgatttg 120 
tttgaacgca gtaaagctcg cggttttgaa gctgtaatgc agcgtttgac agggatttta 180 
gtctttttct ggctagccat tgccttagca ttgacggtat tatcaagtag ataa 234 

<210> 76 
<211> 1110 
<212> DNA 
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<213> Streptococcus pneumoniae 
<400> 76 

atgtttcgta gaaataaatt atttttttgg accacagaaa ttttactctt aaccatcatc 60 
ttttacctat ggagacagat gggatctttg attaaccctt ttgttagcgt gcttaataca 120 
attatgattc catttttatt agggggcttt ctttattatt tgacaaaccc tattgttact 180 
ttcttaaata aagtctgtaa actcaatcgt ttgcttggta ttttaattac cttgtgtact 240 
ttggtctggg gaatggtcat aggtgttgtc tatctcttac ctattttgat taatcagtta 300 
tctagtttga ttatatctag tcaaactatt tatagtcgag tacaagactt aatcatagac 360 
ttatctaatt atcctgcgct ccagaatttg gatgtagaag ctacaattca gcagttaaac 420 
ttatcctatg ttgatattct tcaaaatatc ctaaatagcg tatcaaatag tgtggggagc 480 
gtcttgtcag ctcttatcag tactgttttg attttgatta tgactccagt ttttttggtt 54 0 
tatttcttat tagatggaca taaattcttg cccatgcttg aaagaacgat tctaaagagg 600 
gatcgcttgc atattgcagg cttattaaag aatttaaatg cgacgattgc tcgctatatt 660 
agtggagttt cgattgacgc aatcattata ggttgtttgg cttatattgg ctatagtatt 720 
attggtttaa aatatgcttt agtttttgcc attttttctg gtgtagccaa tttaattcct 780 
tatgtggggc caagtattgg tttgattcct atgatcatcg caaatatatt cactgtaccc 840 
catagactgc tgattgcagt gatttatatg cttgttgttc agcaggtaga tggcaatatc 900 
ttatatcctc gaattgtagg aagtgttatg aaggttcatc caatcacgat tttagtttta 960 
cttttgttgt caagcaatat ctatggtgta gttggaatga ttgtcgcagt gccaacctat 1020 
tctatcttga aagaaatttc taagttctta tcccgtttgt atgaaaatca taaaataatg 1080 
aaagaacgag aaagagaatt agctaagtaa 1110 

<210> 77 
<211> 1356 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 77 

atgtatcaag cactttatcg aaaatataga agtcaaaact tctcccagtt agttggtcaa 60 
gaagttgtgg ctaagactct taaacaagcg gtggagcaag agaaaataag tcacgcttat 120 
cttttttctg gtcctcgtgg aacgggaaaa accagtgttg ctaaaatctt tgccaaggct 180 
atgaactgtc ccaatcaagt gggtggcgaa ccttgcaata actgctatat ttgtcaagca 240 
gtgacggacg gtagtttaga agatgtcatt gaaatggatg cagcttctaa taatggggta 300 
gatgaaattc gcgaaattcg tgataaatct acctatgcgc ctagccttgc tcgttataag 360 
gtttatatca tagatgaggt tcacatgctg tctacagggg cttttaatgc cctcctaaag 420 
acgctggaag aaccaacaca gaatgtagtc tttattttgg ccactactga attgcacaag 4 80 
attcctgcta ctattctatc ccgtgtgcaa cgttttgagt ttaaatcaat taagacacag 540 
gatattaagg aacatattca ctatatctta gaaaaagaaa atatcagttc tgaaccagag 600 
gctgtggaaa tcattgccag acgggcggaa ggtggaatgc gggacgcctt gtctattttg 660 
gatcaagccc tgagtttgac acagggaaat gagctgacga ctgctatctc tgaagaaatt 720 
actggcacca ttagcctatc agccttggat gattatgtgg cggccttgtc tcaacaggat 780 
gttcccaaag. ctttgtcttg cttgaatctt ctttttgaca atggtaagag catgactcgt 840 
tttgtgaccg atcttttgca ctatttaaga gacttgttaa ttgttcaaac agggggagaa 900 
aatactcatc atagttcagt ctttgtagaa aatttggcac ttcctcaaaa aaatctgttt 960 
gaaatgattc gcttagcaac agtgaattta gcagatatta agtctagttt gcagcccaag 1020 
atttatgctg aaatgatgac cgtccgtttg gcggaaatca agcccgaacc agctctatca 1080 
ggagcggttg aaaatgaaat tgctacgctg agacaggaag ttgcccgtct caaacaagag 1140 
ctttctaatg caggtgcggt tcctaaacaa gttgcaccag ctcctagtcg accagctacg 1200 
ggcaaaacag tctatcgtgt cgatcgcaat aaagtgcaat ctatcttaca agaggccgtc 1260 
gaaaatcctg atttaacacg tcaaaatcta attcgtttgc agaatgcatg gggagaggta 1320 
attgaaagtc taggtgggcc ggacaagctc tgctag 1356 

<210> 78 
<211> 1989 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 78 

atgtttcgat taaccaataa gttagcggta tcgaacttga ttaaaaaccg caaactctac 60 
tatccttttg cgctggctgt tctcttggca gtcactctca cctatctctt ttactctcta 120 
accttcaatc ctaagattgc ggaaatccgt ggaggaacaa ccattcaggc tacacttgga 180 
tttggtatgt ttgtcgtcac ccttgcgtca gccattatcg ttctctatgc caatagtttt 24 0 
gtcatgaaga aacgttccaa ggaactagga atttatggca tgttgggctt ggagaagcgt 300 
catcttatca gtatgacctt taaggagtta gtggtatttg ggattctaac tgttggagcg 360 
ggtatcggta ttggagcctt gtttgacaag ttaattttcg ctttcctgct caaactaatg 4'20 
aaattgaagg ttgagctggt tgctaccttc cagacgaaag ttgtcattac agtgcttgtt 480 
gtcttcggtt tgattttcct aggcctcatg ttcctgaatg cccttcgaat cgcccgtatg 540 
aatgccctcc agctctctcg tgagaaagct agtggagaga aaaaaggtcg cttccttcct 600 
ctccaaacca ttcttggttc cataagttta ggaattggct attatcttgc ccttacggta 660 
aaagatcctc ttacagcctt aacaaccttc ttcatagctg ttttactggt tatctttggg 720 
acttatctct tgtttaatgc agggattacc gttttcctcc aaatcttaaa gaaaaataag 780 
aaatactatt accaaccaaa taacctcata tctgtttcta acttgatttt ccgtatgaag 840 
aaaaatgcag ttggactagc aactatcgct attttgtcaa caatggtttt ggtaaccatg 900 
tcagcagcga caagcatttt caattccgca gaatccttta aaaaagttct aaatcctcat 960 
gattttgggg tttcaggaca aaatgttgaa aaagaagatt tggacaaact cttgagccag 1020 
tttgcaagtg acaatggtta taagattaaa gaaaaagaag tgtttcgtta cacttacttt 1080 
ggtgttgcga accaagaagg aaataagtta accttttttg aaaaaggaca aaatcgtgtc 1140 
caacccacaa cagttttcat ggtatttgac caaaaagatt atgaaaatat gactggtcaa 1200 
aaactgtctc tatcaggaaa tgaggtcggt ctctttgcca aaaatgacgg actgaaagga 12 60 
cagaaaactc taattctgaa tgatcatcaa ttttctgtaa aagaagaatt taataaagat 1320 
tttattgtca accatgtccc aaatcagttt aatattttga ctgctgatta caattacctt 1380 
gttgtacctg atttacaagc ctttttgaac caattcccag attcggatat ctataatcag 14 40 
ttttacggtg gtatgaatgt aaatgtcagt gaagaagaac aactcaaggt cgctgaggag 1500 
tatgaaaact acctcaatca atttaatgct caattagaca cagaaggtag ctatgtttat 1560 
ggtagcaatc tagcagatgc tagttctcag atgagtgccc tctttggtgg tgtcttcttt 1620 
atcggtattt tcctatccat tatctttatg gtcggaactg ttctggtcat ctactacaaa 1680 
caaatttctg aaggctacga agaccgtgaa cgctttatta tcttgcagaa agtcggtttg 17 40 
gaccaaaagc aaatcaagca aaccatccac aaacaggttt taactgtttt cttccttcct 1800 
ttgctctttg ccttcataca tctcgccttt gcctaccata tgcttagcct gattttaaaa 18 60 
gtgattggtg tactggatac gactatgatg ttgattgtga ccttgtctat ctgcgctatc 1920 
ttcctcatcg cctatgtgct gattttcatg attacttcaa gaagttatcg caagattgtq 1980 
caaatgtaa 9 * 1989 

<210> 79 
<211> 891 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 79 

atgaaacaag atcaactaaa ggcttggcaa ccagctcagt ttgaccgttt tgtccgtatc 60 
ttagaacaag accagctcaa tcacgcctat ctcttttcag gtttctttgg aagcttggaa 120 
atggcgcaat ttttagctaa gagcctcttt tgtacggata aagttggcgt cttaccatgt 180 
gagaaatgcc gaagttgcaa gctgattgaa caggaagagt ttccagatgt caccttgatt 24 0* 
aagccagtca atcaggtcat caagacagaa cgcattcggg aattggtggg acagttttct 300 
caagcaggga ttgaaagcca gcaacaggtc tttattatcg agcaagcgga taaaatgcat 360 
cccaacgcag ccaattctct gctcaaggtc atcgaagaac cccagagtga agtttatatt 420 
ttcttcttga ctagcgatga ggaaaagatg ttaccgacaa tccgaagtcg gactcagatc 480 
ttccacttta aaaagcaaga agaaaaactt atcttactct tagaacaaat gggacttgtt 540 
aagaaaaaag cgactctttt agctaagttt agtcaatcgc gagctgaagc agaaaagttg 600 
gctaatcagg caagtttttg gaccttggtc gatgaaagtg aacgcctgct gacttggtta 660 
gtagctaaga aaaaagaaag ttatctacag gttgccaaat tagccaactt ggcagatgat 720 
aaggaaaaac aggatcaggt tttacggatt cttgaagttc tctgtgggca ggacctcttg 780 
caggtaagag taagagtgat tctacaagat ttactagaag ctagaaaaat gtggcaagct 840 
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aatgtcagct ttcaaaatgc catggaatat ctggtcttga aagaaatata a 891 

<210> 80 
<211> 615 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 80 

atgaattcat ttaaaaattt cttaaaagag tggggactgt tcctcctaat tctgtcatta 60 
ctagctttaa gtcgta.tctt tttttggagc aatgttcgcg tagaaggaca ttccatggat 120 
ccgaccctag cggatggcga aattctcttc gttgtaaaac accttcctat tgaccgtttt 180 
gatatcgtgg tggcccatga ggaagatggc aataaggaca tcgtcaagcg cgtqattgga 240 
atgcctggcg acaccattcg ttacgaaaat gataaactct acatcaatga caalgaaacg 300 
gacgagcctt atctagcaga ctatatcaaa cgcttcaagg atgacaaact ccaaagcact 360 
tactcaggca agggctttga aggaaataaa ggaactttct ttagaagtat cgctcaaaaa 420 
gcccaagcct tcacagttga tgtcaactac aacaccaact ttagctttac tgttccagaa 480 
ggagaatacc ttctcctcgg agatgaccgc ttggtttcga gcgacagccg ccacgtaggt 540 
accttcaaag caaaagatat cacaggggaa gctaaattcc gcttctggcc aatcacccat 600 
atcggaacat tttaa y 

<210> 81 
<211> 987 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 81 



60 



atggtagtat ttacaggttc aactgttgaa gaagcaatcc agaaaggatt gaaagaatta ou 
gatattccaa gaatgaaggc tcatatcaaa gtcatttcta gggagaaaaa aggctttctt 120 
ggtctatttg gtaaaaaacc agcccaagtg gatattgaag cgattagtga aacgactgtt 180 
gtcaaagcaa atcaacaggt agtaaaaggc gttccgaaaa aaatcaatga tttgaacgag 24 0 
cctgtgaaga cggttagtga agaaaccgtt gaccttggtc atgtggttaa tgctattaaa 



300 



aaaatagagg aagaaggtca aggtatttct gatgaagtca aggctgaaat cttaaaacat 360 
gaaagacatg ccagcactat cttagaagaa actggtcaca ttgagatttt aaatgaactt 420 
caaatcgagg aagcgatgag ggaagaagca ggcgctgatg accttgaaac tgagcaagat 480 
caaactgaaa atcaagactt gaaagagatg ggcttgaagg tcgagcaaag ttatgatatt 54 0 
gcccaggtgg ctacggatgt gactgcctat gttcaagcga ttgtggatga catggatgtt 600 
gaagctacac tttcaaatga ttataaccgt cgtagcatca atctacaaat tgacaccaac 660 
gaaccaggtc gtattatcgg ctaccatggt aaagtcttga aggccttgca actgttggct 720 
caaaattatc tttacaaccg ctattccaaa accttctacg ttacaatcaa tgtcaatgat 7'80 
tatgtcgaac accgtgcaga agtcttgcag acctatgcgc aaaaattggc gaatcgtgtt 84 0 
ttggaagaag gtcgcagtca taaaacagat ccaatgtcaa atagcgaacg caagattatc 900 
catcgtatta tttcacgtat ggatggcgtg actagttact ctgaaggtga tgagccaaat 960 
cgctatgttg ttgtagatac agaataa 987 

45 <210> 82 

<211> 1383 
<212> DNA 

<213> Streptococcus pneumoniae 



60 



50 <400> 82 

atgtcaaatt ttgccattat tttagcagcg ggtaaaggga ctcgcatgaa atctgatttg ou 

ccaaaagttt tgcacaaggt tgcgggtatt tctatgttgg aacatgtttt ccgtagtgtg 120 

ggagctatcc aacctgaaaa gacagtaaca gttgtaggac acaaggcaga attggttgag 180 

gaggtcttgg ctggacagac agaatttgtg actcaatctg aacagttggg aactggtcat 240 

^ gcagttatga tgacagaacc tatcttagaa ggtgtgtcag gacacacctt ggtcattgca 300 

ggagatactc ctttaatcac tggtgaaagc ttgaaaaact tgattgattt ccatatcaat 360 

cataaaaatg tggccactat cttgactgct gaaacggata atccttttgg ctatggacga 420 
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attgttcgta atgacaatgc tgaggttctt cggtcattgf tgagcagaag gatgctacag 480 
attttgaaaa gcaaatcaag gaaatcaaca ctggtaacat acgtctttga caacgagcgt 540 
ttgtttgagg ctttgaaaaa tatcaatacc aataacgctc aaggcgaata ctatattaca 600 
gacgtcattg gtattttccg tgaaactggt gaaaaagttg gcgcttatac tttgaaagat 660 
tttgatgaaa gtcttggggt aaatgaccgt gtggcgcttg cgacagctga gtcagttatg 720 
cgtcgtcgca tcaatcataa acacatggtc aacggtgtta gctttgtcaa tccagaagca 780 
acttatatcg atattgatgt tgagattgct ccggaagttc aaatcgaagc caatgttatc 840 
ttgaaagggc aaacgaaaat tggtgctgag actgttttga caaacggtac ttatgtagtg 900 
gacagcacta tcggagcagg agcggtcatt accaattcta tgattgagga aagtagtgtt 960 
gcagacggtg tgacagtcgg tccttatgct cacattcgtc caaa.ttcaag tctgggtgcc 1020 
caagttcata ttggtaactt tgttgaggtg aaaggatctt caatcggtga gaataccaag 1080 
gctggtcatt tgacttatat cggaaactgt gaagtgggaa gcaacgttaa tttcggtgct 1140 
ggaactatta cagtcaacta tgacggcaaa aacaaataca agacagtcat tggagtcaat 1200 
gtctttgttg gttcaaattc aaccattatt gcaccagtag aacttggtga caattccctc 12 60 
gttggtgctg gttcaactat tactaaagac gtgccagcag atgctattgc tattggtcgc 1320 
.ggtcgtcaga tcaataaaga cgaatatgca acacgtcttc ctcatcatcc taagaaccag 1380 



<210> 83 
<211> 936 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 83 

atgtccaaga t.tctagtatt tggtcaccaa aatccagact cagatgccat cggatcatct 60 
gtagcttttg cctaccttgc aaaagaagct tacggtttgg atacggaagc tgttgccctt 120 
ggaactccaa atgaagaaac agcctttgtc ttgaactatt ttggtgtgga agcaccaqgt 180 
gttatcactt ctgccaaagc agagggggca gagcaagtta tcttgactga ccacaatgaa 240 
ttccaacaat ctgtatcaga tatcgctgaa gtagaagttt acggtgttgt agaccaccac 300 
cgtgtggcta actttgaaac tgcaagccca ctttacatgc gtttggagcc agttggatca 360 
gcgtcttcaa tcgtttaccg tatgttcaaa gaacatggtg tagctgttcc taaagagatt 420 
gcaggtttga tgctttcagg tttgatttca gatacccttc ttttgaaatc accaacaaca 480 
cacccaacag ataaaatcat tgctcctgaa ttggctgaat tggctggtgt aaacttggaa 540 
gaatatggtt tggcaatgtt gaaagctggt accaacttgg ctagcaaatc tgctgaagaa 600 
ttgattgaca tcgatgctaa gacttttgaa ctcaacggaa ataatgtccg tgttgcccaa 660 
gtgaacacag ttgacatcgc tgaagttttg gaacgccaag cagaaattga agctgcaatg 720 
caagctgcca acgaatcaaa cggctactct gactttgtct tgatgattac agatatcgtc 780- 
aactcaaact cagaaatatt ggctcttggt gccaatatgg acaaggtcga agcggctttc 840 
aatttcaaac ttgaaaacaa tcatgccttc cttgctggtg ccgtttcacg taagaaacaa 900 
gtggtacctc aattaactga aagctttaat acgtaa 935 

<210> 84 
<211> 678 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 84 

atgatttcaa agagattaga attggtagct tcctttgtgt cacagggggc tattttacta 60 
gatgtgggaa gtgaccatgc ttatctgcct atcgagttgg ttgagagagg ccaaatcaaa 120 
agcgctattg caggtgaggt ggtggaaggt ccctatcagt ctgcggttaa aaatgttgag 180 
gctcacggcc taaaggagaa aatccaagtc cgtttagcca atggcttggc agcttttgaa 240 
gagactgacc aagtgtctgt cattaccatt gctggcatgg gtggtcgttt gattgctagg 300 
attttagaag aaggtttggg gaagttagct aatgtagagc gtttgatcct ccagcccaat 360 
aatcgtgaag acgacttgcg tatctggcta caggatcatg gattccagat tgtagcagaa 420 
agcatcttag aagaagctgg aaagttttat gagattttgg tggtggaagc aggacaaatg 480 
aagctatcag ccagtgatgt tcgctttggt cccttcttgt ccaaagaagt cagtccagta 540 
tttgtccaaa aatggcaaaa agaagctgag aagctagagt tcgccctcgg acaaatccca 600 
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gaaaaaaatc tggaagaacg tcaagttcta gtagataaga ttcaagctat caaqqaqqtq 660 
ctccatgtta gcaagtga 67Q 



<210> 85 
<211> 486 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 85 

atgaatttaa acgatattaa agacttgatg actcaatttg accagtcaag tttgagagaa 60 
ttttcttata aaaatgggac ggatgagttg cagtttagca agaatgaagc gagacctgtg 120 
cctgaagttg caactcaagt cgctccagca cccgttctag caacaccgag tccagtagct 180 
cctacatctg ctccagcaga gactgtagca gaagaagttc cagctccagc tgaagcaagt 240 
gtggctagtg agggaaatct tgtagagagt ccacttgttg gagtggttta cttggctgct 



ID ggaccagata aacctgcctt cgttacagtt ggtgatagtg tcaaaaaagg tcaaacattg 360 
gtaattatcg aagccatgaa agtcatgaat gaaatcccag ctcctaagga tggtgtggta 420 
acggaaattc tcgtctctaa cgaagaaatg gttgagtttg gtaaaggatt ggtacgtatc 4 80 
aaatga 486 

20 <210> 86* 
<211> 1236 
<212> DNA 

<213> Streptococcus pneumoniae 
25 <400> 86 

atgaaactaa atcgagtagt ggtaacaggt tatggagtaa catctccaat cggaaataca 60 
ccagaagaat tttggaatag tttagcaact gggaaaatcg gcattggtgg cattacaaaa 120 
tttgatcata gtgactttga tgtgcataat gcggcagaaa tccaagattt tccgttcgat 180 
aaatactttg taaaaaaaga taccaaccgt tttgataact attctttata tgccttgtat 240 
gcagcccaag aggctgtaaa tcatgccaat cttgatgtag aggctcttaa tagggatcgt 300 
tttggtgtta tcgttgcatc tggtattggt ggaatcaagg aaattgaaga tcaggtactt 360 
cgccttcatg aaaaaggacc caaacgtgtc aaaccaatga ctcttccaaa agctttacca 420 
aatatggctt ctgggaatgt agccatgcgt tttggtgcaa acggtgtttg taaatctatc 480 
aatactgcct gctcttcatc aaatgatgcg attggggatg ccttccgctc" cattaagttt 540 
S3 ggtttccaag atgtgatgtt ggtgggagga acagaagctt ctatcacacc ttttgccatc 600 
gctggtttcc aagccttaac agctctctct actacagagg atccaactcg tgcttcgatc 660 
ccatttgata aggatcgcaa tgggtttgtt atgggtgaag gttcagggat gttggttcta 720 
gaaagtcttg aacacgctga aaaacgtgga gctactatcc tggctgaagt ggttggttac 780 
ggaaatactt gtgatgccta ccacatgact tctccacatc cagaaggtca gggagctatc 840 
\) -aaggccatea aactagcctt ggaagaagct gagatttctc cagagcaagt agcctatgtc 900 
aatgctcacg gaacgtcaac tcctgccaat gaaaaaggag aaagtggtgc tatcgtagct 960 
gttcttggta aggaagtacc tgtatcatca accaagtctt ttacaggaca tttgctgggg 1020 
gctgcgggtg cagtagaagc tatcgtcacc atcgaagcta tgcgtcataa ctttgtacca 1080 
atgacagctg ggacaagtga agtatcagat tatatcgaag ctaatgtcgt ttatggacaa 1140 
> ggcttggaga aagaaattcc atacgctatt tcaaatactt ttggttttgg aggccacaat 1200 
gcagttcttg ctttcaaacg ttgggagaat agataa 1236 



<210> 87 
<211> 1080 
) <212> DNA 

<213> Streptococcus pneumoniae 



<400> 87 

atgaacatct atgatcaact acaagttgta 
ctgagtgacc ctgatgtcgt ttcagacacc 
gcttccaatc gtgacaccgt aatagcctac 
gtcgatgccg aagagatgat taaggaatca 



gaagaccgtt atgaagaatt aggagaattg 60 
aagcgtttta tggagctttc aaaagaagaa 120 
cgtgagtata aacaagtcct tcaaaatatc 180 
ggcggagatg cggacttgga agaattggcc 24 0 
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aagcaagaac tcaaagatgc caaggctgaa aaagaagaat atgaagaaaa actgaaaatt 300 
ttgctccttc caaaggatcc aaacgatgac aagaatatca tccttgaaat ccgtggagca 360 
gctggtggag acgaagcggc acttttcgct ggagatttgc taactatgta ccaaaagtat 420 
gcggaagccc aaggttggcg ctttgaagtc atggaagcct ctatgaatgg tgtcggtggt 480 
5 tttaaagaag tggttgctat ggtttcaggt cagtctgtat actctaagct taagtatgaa 540 
tcaggtgccc accgtgtgca acgtgttcct gtgacagaaa gccaaggccg tgttcatact 600 
tcgacagcga cagttcttgt tatgccagaa gttgaagagg ttgaatacga cattgatcca 660 
aaagaccttc gtgtcgacat ctatcacgcc tctggtgctg gtggacagaa cgtcaataag 720 
gttgcgactg ccgttcgtat cgttcaottg ccaaccaata tcaaggttga gatgcaggaa 780 
iu gaacgtaccc agcagaagaa ccgcgagaag gctatgaaga ttatccgtgc acgcgtcgct 840 

900 
960 

. - - - => 3-- - - auuyyciugaa 1020 

gttgtggatg ccttggtgct ttatgaccaa acacaaaaac tagaagaatt aaacaaataa 1080 

<210> 88 
<211> 1680 
<212> DNA 
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55 



gaccactttg ctcagattgc tcaggatgaa caagacgctg agcgtaagtc gacaatcggt 
actggt.gacc gttcagaacg gatccgaact tataacttcc cacaaaaccg tgtcacagac 
caccgtatcg gcttgaccct ccaaaaacta gatacgattt tgtctggtaa attggacgaa 



<213> Streptococcus pneumoniae 
<400> 88 



atggcctaca ctcttaaacc tgaagaagtt ggtgtttttg ccatcggtgg tctaggagaa 60 
atcgggaaaa acacttacgg aattgaatac caagacgaga ttatcatcgt cgatgctggg 120 
attaaattcc cagaagatga cttgcttggt atcgactatg tcattcctga ctactcttac 180 
atcgtagaca atatcgaccg cgtcaaggct gttttaatca cacacggaca cgaggaccac 24 0 
attggtggga ttccgttcct actcaagcaa gcaaatgtcc ctatttatgc tggaccgctt 300 
gccttggctt tgatccgtgg gaaactcgaa gaacacggcc tcttgcgcaa cgccaaactt 360 
tacgaaatca accacaacac cgagttgacc tttaaaaatc tcaaggcaac tttctttaga 420 
acgactcact ctattccaga gcctttgggg attgtcattc atactcctca agggaaaatc 480 
gtctgtacgg gtgactttaa gttcgacttt actccagttg gagaacctgc ggacttgcat 540 
cgtatggctg cgcttggtga agaaggcgtg ctctgtctcc tgtctgactc gacaaatgcg 600 
gaagtaccaa cctttaccaa ctctgaaaaa gtcgttggtc agtccattat gaagattatc 660 
caaggtattg aaggacgtat catctttgca tcctttgcct caaatatctt ccgtctccag 720 
caggcaacag aagctgctgt taagactgga cgcaagattg cggtctttgg tcgttctatg 780 
« gaaaaggcca ttgtcaacgg aatcgatctt ggctacatca aagctcctaa gggaaccttt 840 
atcgagccaa atgaaatcaa agattatcct gcaggagaag ttcttatcct ctgtacaggt 900 
agtcagggtg agcctatggc agccctctct cgtatcgcca acggaaccca ccgtcaagta 960 
caattacaac caggtgatac cgttatcttc tcttctagtc ccatccctgg aaacactact 1020 
agcgtcaaca agctgattaa catcatttct gaagctggtg tcgaagttat ccacggtaaa 1080 
gtgaacaata tccatacatc tggacacggt ggtcagcaag agcaaaaact catgctctgc 1140 
ttgattaagc caaaatactt catgcctgtc cacggtgaat accgcatgca aaaagtccac 1200 
gctggactag cagtggatac tggtgttgag aaggacaata tctttatcat gagcaatggc 1260 
gatgtgcttg cccttactgc tgactcagct cgtatcgcag gtcatttcaa cgcccaagat 1320 
atctatgtcg atggaaatcg tatcggtgaa attggcgcag ctgtcctcaa agatcgtcgc 1380 
gatctatctg aagacggtgt cgttctggca gtcgcaactg ttgacttcaa atcgcagatg 1440 
attctgtctg gcccagatat cctcagccga ggctttgtct acatgagaga gtctggagac 1500 
ttgattcgcc aaagccagcg tatcctcttc aatgccattc gtatcgcact gaaaaataag 1560 
gatgctagcg tgcaatctgt caatggtgcc -attgtcaacg ctattcgccc cttcctctat 1.620 
gaaaataccg aacgtgaacc gatcatcatc ccgatgatcc tcacaccaga tgaagaataa 1680 



<210> 89 
<211> 1362 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 89 

atggcagaag tagaagagtt acgagtacaa cctcaagata tcttagctga gcaatccgtt 
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ttaggggcta tctttattga tgagagtaaa cttgtttttg tgcgagaata cattgagtct 120 
cgggactttt ttaagtatgc ccatcgtttg attttccaag ccatggtcga tttatccgat 180 
cgtggtgatg ccatagatgc aacaacggtt cgtactatcc ttgataatca aggtgattta 24 0 
cagaatattg gtggcttgtc ttacttggtt gagattgtta attctgtgcc aacttctgct 300 
5 aatgcggagt attatgctaa gattgttgca gaaaaagcaa tgctacgtcg tttaattgcc 360 
aagttgacag agtctgtcaa ccaagcttac gaagcgtcac aaccagctga tgaaattatt 4 20 
gctcaggcag aaaaagggtt gattgatgtc agtgaaaatg caaatcgaag cgggtttaag 480 
aacattcgag atgtgttgaa tctcaacttt ggaaatctgg aagctcgctc gcaacaaacg 540 
accgatatta caggtattgc gacaggttat cgtgatttgg atcatatgac aacaggactt 600 

10 catgaggagg agttgattat cttagcagct cgtccagcag ttggtaagac agcatttgcc 660 
ttgaatatcg ctcagaacat tgggactaag ttggacaaaa cggttgctat tttttcactc 720 
gaaatgggtg cggaaagctt ggtagatcgt atgttagctg cagaaggctt ggtggagtca 780 
cattctatcc gtacagggca attgacagat gaggagtggc aaaaatatac tattgctcag 840 
ggtaatctag ctaacgccag tatctatatc gatgatacgc caggtattcg gattacagag ,900 

15 attcgttctc gttctcgtaa attggctcaa gaaactggaa atcttggttt gattgtgata 960 
gactatttgc aacttatcac gggaactggt cgagaaaatc gtcaacaaga agtttctgaa 1020 
atttctcgtc agttgaaaat actagccaag gaattgaagg ttccagtaat cgctctgagt 1080 
cagctttctc gtggtgtaga acaacgtcag gacaacjagac cggtcttgtc tgatattcgt 1140 
gaatctgggt ctattgagca ggacgctgat atcgtagctt ttctctatcg cgatgactac 1200 

20 tatgaacgtg gtggtgaaga agaggagggt atcccaaata ataaggtgga agttattatc 1260 
gagaaaaacc gtagtggagc tcgtggaaca gtggaattga ttgtccaaaa agaatacaat 1320 
aaattcttca agtattcaa.g tatctcaaag agggaggcat aa 1362 

<210> 90 
25 <211> 693 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 90 - 

30 atggcgtata aatatttagt gattgtagaa tcacctgcaa aagccaagac aattgagaaa 60 
tatcttggac gaaactataa agtaatggcc agtgttggcc atataagaga tttaccgaaa 120 
agtaaaatgg gtatcgattt tgaaaacaat tatgaacccc attatatttc tatacgcgga 180 
aaaggcgatg tcatcaaaag cctgaaagcc gcagcaaaaa aagctcaaaa agtttacttg 240 
gcaagtgacc cggatagaga aggagaagcg attgcttggc atttagcgta cctacttggg 300 

35 ttggatctga aagaaaaaaa tcgggtggtc ttcaatgaaa tcacaaaaga cgcagtcaaa ■ 360 
gcagctttta aggaaccaag aacgatcgat gtagatttag tagatgcaca gcaagctcgt 420 
cgtaccttag acagaatcgt tggttattcg atcagtccta ttctctggcg taaggtcaag 480 
aaagggttaa gtgcaggacg tgtccaatct gtcgctttaa aaattattat tgaccgtgaa 540 
aaagagatcc gagaatttgt tccagaagaa tattggagca tcgacggtaa ttttaaaaaa 600 

40 gctcgcaaga aattcaaagc aaatttctgg ggaatcgacg gtaagaaaaa gaaattacca 660 
gatgcacaaa gtgtaaaaag aagtcactgc tag 693 

<210> 91 

<211> 981 

45 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 91 

atgtttattt ccatcagtgc tggaattgtg acatttttac taactttagt aggaattccg 60 
50 gcctttatcc aattttatag aaaggcgcaa attacaggcc agcagatgca tgaggatgtc 120 
aaacagcatc aggcaaaagc tgggactcct acaatgggag gtttggtttt cttgattact 180 
tctgttttgg ttgctttctt tttcgcccta tttagtagcc aattcagtaa taatgtggga 240 
atgattttgt tcatcttggt cttgtatggc ttggtcggat ttttagatga ctttctcaag 300 
gtctttcgta aaatcaatga ggggcttaat cctaagcaaa aattagctct tcagcttcta 360 
55 ggtggagtta tcttctatct tttctatgag cgcggtggcg atatgctttc tgtctttggt 420 
tatcaagtgc atctagggat tttctatatt gttttcgctc ttttctggc.t agtcggtttt 480 
tcaaacgcag taaacttgac agacggtgtt gacggtttag ctagtatttc cgttgtgatt 540 
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agtttgtctg cctatggagt tattgcctat gtgcaaggtc agatggatat tcttctagtg 600 
attctggcca tgattggtgg tttgctcagt ttcttcatct ttaaccataa gcctgctaag 660 
atctttatgg gtgatgtggg aagtttggct ttaggtggaa tgctggcagc tatctctatg 720 
gctctccacc aagaatggac tctcttgatt atcggaattg tgtatgtttt tgaaacaact 780 
tctgttatga tgcaagtcag ttatttcaaa ctgacaggtg gtaaacgtat tttccgtatg 840 
acgcctgtac atcaccattt tgagcttggg ggattgtctg gtaaaggaaa tccttggagc 900 
gagtggaagg ttgacttctt cttttgggga gttgggcttc tagcaagtct cctgacccta 960 
gcaattttat atttgatgta a 981 

10 <210> 92 

<211> 2082 
<212> DNA 

<213> Streptococcus pneumoniae 
15 <400> 92 

atggcacgcg aattttcact tgaaaaaact cgtaatatcg gtatcatggc tcacgtcgat 60 
gccggtaaaa caacaactac tgagcgtatt ctttactaca ctggtaaaat ccacaaaatc 120 
ggtgaaactc acgaaggtgc gtcacaaatg gactggatgg agcaagagca agaacgtggt 180 
atcacgatca catctgctgc gacgacagct caatggaaca accaccgcgt aaacatcatc 240 
gacacaccag gacacgtgga cttcacaatc gaagtacaac gttctcttcg tgtattggat 300 
ggtgcggtta ccgttcttga ctcacaatca ggtgttgagc ctcaaactga aacagtttgg 360 
cgtcaagcaa ctgagtacgg agttccacgt atcgtatttg ccaacaaaat ggacaaaatc 420 
ggtgctgact tcctttactc tgtaagcaca cttcacgatc gtcttcaagc aaatgcacac 480 
ccaatccaat tgccaatcgg ttctgaagat gacttccgtg gtatcattga cttgatcaag 540 
atgaaagctg aaatctatac taacgacctt ggtacggata tccttgaaga agacatccca 600 
gctgaatacc ttgaccaagc tcaagaatac cgtgaaaaat tgattgaagc agttgctgaa 660 
actgacgaag aattgatgat gaaatacctc gaaggtgaag aaatcactaa cgaagaattg 720 
aaagctggta tccgtaaagc gactatcaac gttgaattct tcccagtatt gtgtggttca 780 
gccttcaaaa acaaaggtgt tcaattgatg cttgatgcgg ttatcgacta ccttccaagc 840 
ccacttgaca tcccagcaat caaaggtatt aacccagata cagacgctga agaaattcgt 900 
ccagcatctg acgaagagcc atttgcagct cttgccttca agatcatgac tgacccattc 960 
gtaggtcgtt tgacattctt ccgtgtttac tcaggtgttc ttcaatcagg ttcatacgta 1020 
ttgaatactt ctaaaggtaa acgtgaacgt atcggacgta tccttcaaat gcacgctaac 1080 
agccgtcaag aaatcgacac tgtttactca ggtgatatcg ctgctgccgt tggtttgaaa 1140 
gatactacaa ctggtgactc attgacagat gaaaaagcta aaatcatcct tgagtcaatc 1200 
aacgttccag aaccagttat ccaattgatg gttgagccaa aatctaaagc tgaccaagac 1260 
aagatgggta tcgcccttca aaaattggct gaagaagatc caacattccg cgttgaaaca 1320 
aacgttgaaa ctggtgaaac agttatctca ggtatgggtg aacttcacct tgacgtcctt 1380 
gttgatcgta tgcgtcgtga gttcaaagtt gaagcgaacg taggtgctcc tcaagtatct 144 0 
taccgtgaaa cattccgcgc ttctactcaa gcacgtggat tcttcaaacg tcagtctggt 1500 
ggtaaaggtc aattcggtga tgtatggatt gaatttactc caaacgaaga aggtaaagga 1560 
ttcgaattcg aaaacgcaat cgtcggfggt gtggttcctc gtgaatttat cccagcggtt 1620 
gaaaaaggtt tggtagaatc fatggctaac ggtgttcttg caggttaccc aatggttgac 1680 
gttaaagcta agctttatga tggttcatat cacgatgtcg actcatctga aactgccttc 1740 
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aagattgcgg cttcactttc ccttaaagaa gctgctaaat cagcacaacc agctatcctt 
gaaccaatga tgcttgtaac aatcactgtt ccagaagaaa accttggtga tgttatgggt 1860 
cacgtaactg ctcgtcgtgg acgtgtagat ggtatggaag cacacggtaa cagccaaatc 1920 
gttcgtgctt acgttccact tgctgaaatg ttcggttacg caacagttct tcgttctgca 1980 
tctcaaggac gtggtacatt catgatggta tttgaccact acgaagatgt acctaagtca 2040 
gtacaagaag aaattattaa gaaaaataaa ggtgaagact aa 2082 



<210> 93 
<211> 1227 
<212> DNA 
55 <213> Streptococcus pneumoniae 

<400> 93 
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atgccaaatt acatftattcc attttcaccg cctgatatca cagaagcaga aattgctgaa 60 
gtagcggata ccctgcgttc tggttggatc acaacaggtc ctaaaacaaa agaacSgag 120 
cgcogcttgt ctctttacac acagacacct aagactgttt gtctcaactc tgSa««c 180 
gctctggagt tgattttacg cgttttggaa gtgggacctg gtgatgaagt catcgSca 2^0 
TtZJT at3C9gC " C ^gtagtgtc attacgcacg tgggagcaac ccctgSatg 300 
gtggatatcc aagcagatac gtttgagatg gactatgacc tgcttgagca agctatcact 360 
lllltlr agg f gattat ceeagt-gag ctcgcaggga ttgtttgcga tt'a^accS 420 
" g "" aa9 fc ^ gg f gaa aaaacgtgac ttctttaccg cttcaagcaa gtggcaaaag 480 
gcctttaacc gtattgtcat tgtctctgat agtgcccacg ctttgggatc tacitataS 540 
ggacaacctt ctggttctat cgctgatttt acttccttct cattccatgc cgttaagaac 600 
tttacaacgg cagaaggtgg aagtgcgact tggaaagcca atccagtgat tgatgacgaa 660 
gagatgtaca aggaattcca aatcctttcc cttcacgggc aaactaagga tgctrtt?" 720 
aagatgcaac tggggtcatg ggaatacgat atcgttacac cagcctataa gtgcaacatg 780 
accgatatca tggcttcact tggtttggta caattggacc gctatccaag tttgttgca! 840 
cgccgtaagg acattgtgga ccgctatgat agtggttttg caggttctcg catccatcct 900 
ttggcacaca agactgaaac tgtcgaatct tcacgccacc tctacatcac ccgtgtagaa 960 
SSS^n ' agaagaacg ^gcctcatc atccaagaat tggctaaagc aggaattjca 1020 
agtaatgttc actacaaacc gcttcctcto ttgacagcct ataagaatct tggatttgat 1080 
atgacgaact atcctaaggc ctatgcctto tttgagaatg aaattaccct ccctcttcS 1U0 

izttit:: ss £ a a t 9 ga ctat atcattga9a ctttcaaaac a9tttctgaa 1200 



1227 



<210> 94 
<211> 978 
25 <212> DNA 

<213> Streptococcus pneumoniae 

<4 00> 94 



W ■ ! t9 T?f aC ct 9 attttt 9 gaacgataat attgcggccc aaaaaacgtc gcaagaattg 60 
a aaatt a tta a ESSSS Ca \ taCCttC "taagatgg aagagttgca ggatgaagtc 120 
gaaattttat tggatttttt ggctgaagac gagtcagtgc atgatgaact ggtagcgcag 180 
ttagccgaac ttgataagat- aatgaccagc tacgagatga ctttaetctt gtcagaacc"? 240 
tatgaccaca acaatgccat cttggaaatc catccaggtt ctggtggtac tgaggcgcao 300 
atat £ tgct tcgtatgtat actcgttatg gtL'tgctaa ag^SSSJ 3 0 
gtggaagtgt tggattacca agcaggtgat gaggctggta ttaagtcggt aactttatca 420 
tttgaagggc ctaatgccta tggtctcctc aagtcagaaa tgggtgttca ccgcttagtg 480 
cgaatctcac catttgactc tgccaaacgt cgccatacct ctttcacatc tg? a gaagtg 540 
atgccagaat tggatgatac tattgaagtg gaaatccgtg aagatgatat caagatggat 600 
accttccgtt caggtggtgc cggtggacaa aacgtcaata aagtttcaac aggtgtSgt 660 
ttaacccaca ttccaactgg aattgttgtc caatcaacag tggatcgtac ccagtatgga 720 
aatagagatc gtgccatgaa gatgttgcag gctaagctct atcaaatgga gcaagagaag 780 
aa9g £ gCgg a Mtagattc tctcaaaggt gagaaaaagg agattacttg gggaagccaa 840 
atccgttctt atgtcttcac gccttatact atggtaaaag atcaccgaac tagctttgag 
gttgctcagg tagataaggt tatggatggg gacctagatg gttttatcga tgcttatctc 
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aagtggcgaa ttagctaa *™ 
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<210> 95 
<211> 750 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 95 



atgttttata cttatttgcg tggattagtt gtattgctct tatggtccat caatggcaat 60 
gctcactatc ataatactga taaaattcct aatcaagatg aaaattatat tttagttgcg 120 
S^Xff* ******** tcctgtttat atggcctttg cgaccaagcc aaaacag??? 180 
atctttatgg caaaaaaaga actctttacc aaccgtatct ttggttggtg gattcgtata 240 
tgtggcgcct ttcccatcga ccgtgaaaat cccagcgcct cagcWcaa atatcSat? 300 
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aaca^ Ca aa3aaagtga «*J<*ctctc atcatgtttc caagtggtag ccgccactca 360 
• rt2£? aggggggcgc agcactgatt gccaaaatgg ccaaggtccg tatcatgccg 420 
gttacctaca ccggtcccat gactttgaag ggcttgatta gccgtgaacg tgtcgaiatg 480 
aactttggaa atccaatcga tatctcagat atcaagaaaa tgaatgatga aggcSgaa MO 
cZltZ'T atq9tattCa «c. g ..ttc caacgtctgg acgaagaaac gaaacaa^g 6 0 
actlttlt,* aaaaac ^ aaa tccactctgg tggtttatcc gcatccctgc cctcatcc?? 660 
gctattatcc tcgctatcct aaccatcatc tttagcttta tcgcaagctt catctggaac 720 
ccagataaga aaagagaaga acttgcatag 750 

10 <210> 96 

<211> 3102 
<212> DNA 

<213> Streptococcus pneumoniae 
15 <400> 96 

ttgatcgcac aactagatac aaaaacagtc tatagtttta tggaaagcgt catttcgatc 60 
gaaaagtatg tgagagcagc taaagaatac ggctacactc atttggctat gatggatatt 120 
gacaatcttt atggcgcttt cgactttcta gagattacaa aaaaatacgg cat^catcct 180 
20 i!?S a ? 9 ? C " gaaatgac ^gtttgta gatgatcagg gagtaaattt gcgcttttta 240 
20 gctctatcta gtgtgggcta tcagcagttg atgaagcttt cgacagccaa gatgcagggg 300 

tSaataLn ?? °£ gtCCCagtac ctggaggata tcgcggtcat t«tgUtKt 360 

tttgatagag ttgagtcgtt agaactaggc tgtgattact atataggggt ttatccagaa 420 
TactaaZT ^""J" t«tcctatc ttacctcttt atcgggtcaa cgcttttgaa 480 
25 JTJ? 9 t gagaagttct tcaagtttta acagcgatta aagaaaatct accgctcaga 540 
25 gaagttccct tgcgttcgag acaagatgtc tttatatcag caagttcttt agagaaac?a 600 
ttccaagagc gttttccgca agctttggac aatttagaaa agcttatttc aggcatttct 660 
tacgacttgg atactagtct gaaactgcct cgttttaatc cagctagacc agcagtagag 120 
• gaftgagag agcgtgctga actggggctt gttcagaagg ggttgactag taaa|aa?a? 780 
caagatagac tagaccaaga attgtctgtt attcatgata tgggctttga tgattatttc 840 
In™ If? gggatttgtt Wtttgg. cgatcgaatg gctattatat gggaajggga 900 
aggggttctg cagtaggcag tttggtttct tatgccttag acatcacggg gattgaSS 960 
attlTall* Tt 9Stttt ^acgcttt cttaatcgtg aacgctatac catgcctgat 1020 
tatt-attlt? ata \ CCcaga tatttatcgt ccagatttta tcagatatgt tggtaataaa 1080 
tatggtagta aacatgcggc acaaatcgtt actttttcaa cctttggagc caagcaagct 1140 
aaScaat? T^ 9 ccagagtatg aattaStgc aattactSg 2M 

Tallt-ftt " Cgtgacaa tcttaagtcg gcctatgagg gcaatctcca gtttcgtcag 1260 
caaatcaata gtaagttaga ataccaaaaa gcttttgaga ttgcttgcaa gatagagggc 1320 
aJcta aa ??n aa % CCtCt f P"tgcggct ggtgttgtaa ttagtgacca agat?taJcc 1380 
40 %tl t <*ctaaagta tggtgatgaa attccactga ctcagtatga tgctcatgga 1440 
40 gttgaggcta gcggactttt gaagatggac tttctgggac tacgaaattt glcctttgtc 1500 
cagaagatgc aagagttgct tgctgaaata gaaggtattc accttaaaat tgaagaaata 1560 
gatttggaag acaaagaaac gttagattta tttgcctctg gaaatacaaa aggtatcttt 1620 
n^r,l 9 t g ° aacc * ggtgc tattcgcttg ctcaaacgtg ttcaaccagt ctgttttgaa 1680 
gatgtcgtag caactacttc tctaaatcga ccgggtgcta gtgactatat caataatttt 1740 
?i^ Caa9aa ag f atgggca ggaagaagtg actgttctgg atccagtact ggaggatatt 1800 
ttggctccaa cctacggcat aatgctctat caggagcagg ttatgcaggt tgcccagcga 1860 
tttgccggat ttagtcttgg gaaagccgat attttgcgtc gggctatggg gaaaaaggat 1920 
gcctctgcca tgcatgagat gagggcttcc tttatteaag gttcattaga agctggtcat 1980 
50 IS 9 ? aaa aagcagagca ggtctttgat gttatggaga agtttgcagg ctatggtttt 2040 
50 ^caggtcac acgcctatgc ctactcagca ttggccttcc agttggctta ttttJaaaca 2100 
o2SS g CCatatt ' ta ^^to atg ttgaattctg ccaacagtga ttacttaata 2160 
gatgcacttg aagcaggttt tgaagtggcg cctctgtcca tcaacacgat tccctatcac 2220 
gataaaattg ccaacaaggc catctatcta ggtttgaaat ccattaaagg agtcagtaat 2280 
55 Ultl 9 T tCtggattat tgaacataga ccttattcta atattgaaga ttttatagct 2340 
t?caS^? a f attatct faacttcct ctgctagaac ctttggtaaa agttggtctt 2400 
ttcgattcat ttgaaaaaaa tcgtcaaaaa gtatttaata acttagctaa tctatttgaa 2460 
tttgtgaaag agttgggaag tttgtttgga gatgctattt atagttggca ggaatcggaa 2520 
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gattggacgg aacaagaaaa attttatatg gaacaagagc ttttagggat aggtgtcagc 2580 
aaacatccac tacaagctat tgcaagtaag gctatttacc cgattacccc aatcggaaat 2640 
ttgtcagaaa atagctatgc tattattttg gttgaagttc agaaaataaa agtgattcgt 2700 
accaaaaagg gtgaaaatat ggccttctta caggcagatg atagtaagaa aaaattggat 2760 
gtcactctct tttcagactt atatcgtcag gttggacagg aaataaaaga gggagccttc 2820 
tactatgtaa aaggaaaaat acaatcacgt gatggccgtc tgcaaatgat tgcacaagaa 2880 
ataagagaag cagttgctga acgcttttgg atacaggtga aaaatcatga atcggatcaa 2940 
gaaatttcac gtattttaga acaatttaaa ggcccaatcc cagtcatcat ccggtatgaa 3000 
gaggaacaga aaaccatcgt ttctccccat cattttgtag ctaaatccaa tgaattagag 3060 
gagaaattga atgaaatcgt tatgaaaacg attt.atcgct aa 3102 



<210> 97 
<211> 921 
<212> DNA 
15 <213> Streptococcus pneumoniae 



45 



<400> 97 

atgaccaacg aatttttaca ttttgaaaaa atcagccgcc agacttggca atctttacat 60 
cgaaagacaa cacctccttt gacagaagaa gaattggaat ctatcaagag ttttaatgac 120 
caaatcagtc tccaagacgt tacagatatc tatctcccct tggctcatct gattcagatt 180 
tacaagcgaa ctaaggaaga tttagccttt tcaaaaggaa ttttcctcca acgtgaaagt 240 
aaatctcaac cttttattat tggggtttct gggagtgttg ccgttggaaa atccacaacc 300 
agtcgcctac ttcaaatcct actgtcccgt acgtttacag atgctacggt tgagttggtt 360 
acaactgatg gttttctcta tcccaatcaa accttgattg agcagggaat tttaaatcgt 420 
aaaggatttc ctgaaagcta tgatatggaa gctcttctca acttcttgga ccgcatcaaa 480 
aatggacaag atgtagatat tcctgtctat tctcatgaag tttacgacat cgtacccaaa 54 0 
aagaaacaaa gtgtcaaagc tgctgatttt gtaatcgttg agggaattaa tgtctttcaa 600 
aatccacaaa acgatcgtct ctatatcact gacttctttg acttttccat ctatgtagat 660 
gctggagtgg atgatattga aagttggtat ctggaccgtt tcttgaaaat gctgagtcta 720 
gcccaaaacg accctgatag ctactattat cgttttactc agatgccgat tggggaagtg 780 
gagtcctttg cccatcaggt ctggaccagt atcaatctca caaatctgca aaattatatt 840 
gaaccaacca gaaatcgtgc agaagtgatt cttcataaaa gcaagaacca tgaaatcgat 900 
gaaatttact taaaaaagta a 921 

35 <210> 9'8 
<211> 741 
<212> DNA 

<213> Streptococcus pneumoniae 
40 <400> 98 

atggaaattt cattattaac agatgttggt cagaaacgaa caaataacca agactatgtc 60 
aaccactatg tcaatagagc tggacgtacc atgattattt tagctgatgg gatgggaggt 120 
catcgtgcag ggaatatcgc tagt'gaaatg gcggtcacag acctgggtgt agcttgggtt 180 
gatacccaga tcgatacagt caatgaagtg cgtgaatggt tcgcccatta cctagaaatt 240 
gaaaatcaaa agattcacca gcttggtcag gatgaagctt acagaggcat gggaactact 300 ' 
ttggaagtcc ttgctattat tgataatcag gctatctatg ctcatattgg tgattcgcgt 360 
atcggcttga ttcgtggaga agaataccat cagttgacga gcgatcattc cttggttaat • 420 
gaattgctca aggctggtca attgacacca gaagaggcag aagctcatcc gcaaaaaaat 480 
attatcaccc agtctattgg gcaaaaagat gaaattcagc ctgattttgg gacagttatc 540 
cttgagtcag gtgactatct cttgctcaat agtgacggct tgaccaacat gatttcaggc 600 
agtgagattc gtgatattgt aaccagtgat attcctttag cagataaaac ggagacactt 660 
gttcgttttg ctaacaatgc aggaggttta gacaacatta cggttgccct tgtttctata 720 
aacgaggagg atgaagaatg a 7a 1 



741 



55 <210> 99 
<211> 831 
<212> DNA 



40 



WO 01/49721 



PCT/USOO/35604 



10 



15 



<213> Streptococcus pneumoniae 
<400> 99 

gtgacgatac agatgaagaa tacaggtaaa cgaattgatc tgatagccaa tagaaaaccg 60 
cagagtcaaa gggtcttgta tgaattgcga gatcgtttga agagaaatca gtttatactc 120 
aatgatacca atccggatat tgtcatttcc attggcgggg atggtatgct cttgtcggcc 180 
tttcataagt acgaaaatca gcttgacaag gtccgcttta tcggtcttca tactggacat 240 
ttgggcttct atacagatta tcgtgatttt gagttggaca agctagtgac taatttgcag 300 
ctagatactg gggcaagggt ttcttaccct gttctgaatg tgaaggtctt tcttgaaaat 360 
ggtgaagtta agattttcag agcactcaac gaagccagca tccgcaggtc tgatcgaacc 420 
atggtggcag atattgtaat aaatggtgtt ccctttgaac gttttcgtgg agacgggcta 480 
acagtttcga caccgactgg tagtactgcc tataacaagt ctcttggcgg tgctgtttta 540 
caccctacca ttgaagcttt gcaattaacg gaaattgcca gccttaataa tcgtgtctat 600 
cgaacactgg gctcttccat tattgtgcct aagaaggata agattgaact tattccaaca 660 
agaaacgatt atcatactat ttcggttgac aatagcgttt attctttccg taatattgag 720 
cgtattgagt atcaaatcga ccatcataag attcactttg tcgcgactcc tagccatacc 780 
agtttctgga accgtg.ttaa ggatgccttt atcggtgagg tggatgaatg a 831 

<210> 100 
20 <211> 1623 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 100 

25 atgtcaaaag aaattaaatt ttcatcagat gcccgttcag ccatggttcg tggtgtcgat 60 
attcttgcag acactgttaa agtaaccttg ggaccaaaag gtcgcaatgt cgttcttgaa 120 
aagtcattcg gttcaccctt gattaccaat gacggtgtga ccattgccaa agaaatcgaa 180 
ttggaagacc attttgaaaa tatgggtgct aagttagtat cagaagtagc ttctaaaacc 240 
aatgatatcg caggtgacgg aactacgact gcaacagtct tgacccaagc tatcgtccgt 300 
gaaggaatca aaaacgtcac agcaggtgca aatccaatcg gtattcgtcg tgggattgaa 360 
acagcagttg ccgcagcagt tgaagctttg aaaaacaacg ccatccctgt tgccaataaa 420 
gaagctatcg ctcaagttgc agcagtatct tctcgttctg aaaaagttgg tgagtacatc 480 
tctgaagcaa tggaaaaagt tggcaaagac ggtgtcatca ccatcgaaga gtcacgtggt 54 0 
atggaaacag agcttgaagt cgtagaagga atgcagtttg accgtggtta cctttcacag 600 
tacatggtga cagatagcga aaaaatggtg gctgaccttg aaaatccgta cattttgatt 660 
acagacaaga aaatttccaa catccaagaa atcttgccac ttttggaaag cattctccaa 720 
agcaatcgtc cactcttgat tattgcggat gatgtggatg gtgaggctct tccaactctt 780 
gttttgaaca agattcgtgg aaccttcaac gtagtagcag tcaaggcacc tggttttggt 84 0 
gaccgtcgca aagccatgct tgaagatatc gccatcttaa caggcggaac agttatcaca 900 
gaagaccttg gtcttgagtt gaaagatgcg acaattgaag ctcttggtca agcagcgaga 960 
gtgaccgtgg acaaagatag cacggttatt gtagaaggtg caggaaatcc tgaagcgatt 1020 
tctcaccgtg ttgcggttat caagtctcaa atcgaaacta caacttctga atttgaccgt 1080 
gaaaaattgc aagaacgctt ggccaaattg tcaggtggtg tagcggttat taaggtcgga 1140 
gccgcaactg aaactgagtt gaaagaaatg aaactccgca ttgaagatgc cctcaacgct 1200 
45 aetcgtgcag ctgttgaaga aggtattgtt gcaggtggtg gaacagctct tgccaatgtg 1260 
attccagctg ttgctacctt ggaattgaca ggagatgaag caacaggacg taatattgtt 1320 
ctccgtgctt tggaagaacc cgttcgtcaa attgctcaca atgcaggatt tgaaggatct 1380 
atcgttatcg atcgtttgaa aaatgctgag cttggtatag gattcaacgc agcaactggc 1440 
gagtgggtta acatgattga tcaaggtatc attgatccag ttaaagtgag tcgttcagcc 1500 
ctacaaaatg cagcatctgt agccagcttg attttgacaa cagaagcagt cgtagccaat 15 60 
aaaccagaac cagtagcccc agctccagca atggatccaa gtatgatggg cgggatgatg 1620 
taa 1623 
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- <210> 101 
55 <211> 1446 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 101 

atgattaaga ttgaaaccgt attagatatt ttaaagaaag atggcctttt tcgcgaaatt 60 
attgaccaag gtcattacca ctacaactac agcaaagtta tttttgatag catcagctac 120 
gacagccgaa aagtaacaga agacactctt ttttttgcaa aaggcgctgc ctttaaaaaa 180 
gaataccttc tttctgctat aacacaaggt ttagcttggt atgtagctga aaaggactac 240 
gaagtcgata tccctgtcat cattgtgaac gatataaaga aagccatgag tttgattgcc 300 
atggagttct atggtaatcc acaagagaaa ctcaaactcc ttgcctttac tggtactaag 360 
ggtaagacaa cagcaaccta tttcgcctat aacatcttat ctcaggggca tagacctgct 420 
atgttgtcga ccatgaacac aactcttgat ggcgagactt tctttaagtc agcgttgaca 480 
acccctgaga gtattgacct ctttgacatg atgaatcagg ctgtgctaaa tgaccgtacc 540 
cacctcatca tggaagtctc cagtcaagcc tatctagtcc atcgagtcta tggactgacc 600 
tttgatgtag gagtctttct taacatcact cctgaccata tcggcccgat tgaacaccct 660 
agctttgaag actatttcta ccacaagcgt ctcttgatgg aaaatagccg agcagtcatc 720 
attaacagtg acatggacca cttctcagtc ttgaaagaac aggttgaaga tcaagaccat 780 
gatttctatg gtagccaatt. tgataaccaa atcgagaatt ccaaagcctt tagcttttca 840 
gctacgggta aactcgctgg agattatgat atccaactca ttggcaactt caaccaagaa 900 
aatgcagttg ctgctggact tgcttgtctc cgtctcggag caagtcttga ggacatcaaa 960 
aaaggcatcg ctgcaacccg cgttcctggt cgtatggaag tcctcactca gaaaaatgga 1020 
gccaaggtct tcatcgacta tgcccacaat ggggatagtc tgaaaaaact catcaatgtg 1080 
gttgaaactc atcaaaccgg aaagattgct ctggttctgg gatcaacagg aaacaaggga 1140 
gaaagtcgtc gtaaggactt tggcctcctc ctcaatcaac accctgagat tcaagtcttt 1200 
ctgactgctg atgaccctaa ctatgaagac ccaatggcca ttgcagatga aattagtagc 1260 
tacatcaatc atcctgttga aaagattgcg gatcgccaag aagccatcaa ggcggcaatg 1320 
gctatcacaa atcacgaatt agatgcagtt attattgcgg gtaagggagc cgattgttac 1380 
caaatcatcc agggcaagaa agaatcctac ccaggagata cagccgtcgc agaaaattat 14 40 

ttataa iaac 

1446 

<210> 102 
<211> 1980 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 102 

atgatccaaa tcggcaagat ttttgccgga cgctatcgga ttgtcaaaca gattggtcga 60 
ggaggtatgg cggatgtcta cctagccaaa gacttaatct tagatgggga agaagtggca 120 
gtgaaggttc tgaggaccaa ctaccagacg gacccgatag ctgtagctcg ttttcagcgt 180 
gaagcgagag ctatggcaga tctagaccat cctcatatcg ttcggataac agatattggc 240 
gaggaagacg gtcaacagta cctagctatg gagtatgtgg ctggactgga cctcaaacgc 300 
tatatcaagg aacattatcc tctttctaat gaagaagcag tccgtatcat gggacaaatt 360 
ctcttggcta tgcgcttggc ccatactcga ggaattgttc acagggactt gaaacctcaa 420 
aatatcctct tgacaccaga tgggactgcc aaggtcacag actttgggat tgctgtagcc 480 
tttgcagaga caagtctgac ccagactaac tcgatgttgg gctcagttca ttacttgtca 54 0 
ccagagcagg cgcgtggttc gaaggcgact gtgcagagtg atatctatgc catggggatt 600 
attttctatg agatgctgac aggccatatc ccttatgacg gggatagcgc ggtgaccatt 660 
gccctccagc atttccagaa acccctgccg tccgttattg cagaaaatcc atctgtacct 720 
caggctttag aaaatgttat tatcaaggca actgctaaaa agttgaccaa tcgctaccgc 780 
tcggtttcag agatgtatgt ggacttgtct agtagcttgt cctacaatcg tagaaatgaa 840 
agtaagttaa tctttgatga aacgagcaag gcagatacca agaccttgcc gaaggtttct 900 
cagagtacct tgacatctat tcctaaggtt caagcgcaaa cagaacacaa atcaatcaaa 960 
aacccaagcc aggctgtgac agaggaaact taccaaccac aagcaccgaa aaaacataga 1020 
tttaagatgc gttacctgat tttgttggcc agccttgtat tggtggcagc ttctcttatt 1080 
tggatactat ccagaactcc tgcaaccatt gccattccag atgtggcagg tcagacagtt 114.0 
gcagaggcca aggcaacgct caaaaaagcc aattttgaga ttggtgagga gaagacagag 1200 
gctagtgaaa aggtggaaga agggcggatt atccgtacag atcctggcgc tggaactggt 1260 
cgaaaagaag gaacgaaaat caatttggtt gtctcatcag gcaagcaatc tttccaaatt 1320 
agtaattatg tcggtcggaa atcctctgat gtcattgcgg aattaaaaga gaaaaaagtt 1380 
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ccagataatt tgattaaaat tgaggaagaa gagtcgaatg agagtgaggc tggaacggtc 1440 
ctgaagcaaa gtctaccaga aggtacgacc tatgacttga gcaaggcaac tcaaattgtt 1500 
ttgacagtag ctaaaaaagc tacgacgatt caattaggga actatattgg acggaactct 1560 
acagaagtaa tctcagaact caagcagaag aaggttcctg agaatttgat taagatagag 1620 
gaagaagagt ccagcgaaag cgaaccagga acgattatga aacaaagtcc aggtgccgga 1680 
acgacttatg atgtgagtaa acctactcaa attgtcttga cagtagctaa aaaagttaca 1740 
agtgttgcca tgccgagtta cattggttct agcttggagt ttactaagaa caatttgatt 1800 
caaattgttg ggattaagga agctaatata gaagttgtag aagtgacgac agcgcctgca 1860 
ggtagtgcag aaggcatggt tgttgaacaa agtcctagag caggtgaaaa ggtagacctc 1920 
aataagacta gagtcaagat ttcaatctac aaacctaaaa caacttcagc tactccttaa 1980 



<210> 103 

<211> 1176 

<212> DNA 

15 <213> Streptococcus pneumoniae 



<400> 103 

atgaaacatt ttgatactat tgtcatcggt gggggacctg ctggtatgat ggctacgatt 60 
tccagtaact tttatggaca gaaaaccctc ctcatcgaaa aaaatcggaa acttggaaaa 120 
aaattagctg ggactggtgg gggacgttgc aatgtgacca acaatggtag cttagacaac 180 
ctgctagctg gaattcctgg aaacggacgc tttctttaca gtgttttctc ccagttcgat 240 
aatcatgaca tcatcaactt ttttacagaa aatggtgtta aacttaaggt cgaagaccac 300 
ggacgcgtct ttccagccag tgacaagtct cggactatta tcgaagcttt ggaaaagaaa 360 
atcactgaac taggtggtca agttgctact caaatagaaa tcgtttctgt taaaaaagta 420 
gatgaccagt ttgtccttaa gtcagcggat caaaccttca cttgtgagaa actcattgtc 4 80 
acaacaggtg gtaagtctta tccttcgact ggttcgactg gttttggtca cgagattgct 540 
cgccatttta agcataccat caccgatctt gaggctgctg aaagtccttt attaacagat 600 
tttccacata aagccttaca agggatttct ctggacgatg tgaccctaag ttatggtaag 660 
catgtcatca ctcatgattt actctttacc cactttggtt tgtcaggtcc tgctgcccta 720 
cgcatgtcta gctttgtcaa aggtggggag gttctctcac tcgatgtttt gcctcaactt 780 
tctgagaagg acttggttac atttctagaa gaaaatcggg aaaaatcctt gaaaaacgct 840 
ttaaaaacct tgttaccaga acgcttggcc gaattttttg tacaaggata tcctgaaaaa 900 
gtcaaacagc tgactgaaaa ggaacgagaa caacttgtcc agtccattaa agaacttaaa 960 
attcctgtaa ctggaaaaat gtcccttgca aagtcctttg ttaccaaggg tggagtcagt 1020 
35 ctcaaggaaa tcaatcctaa aacccttgaa agtaagctgg tacctggcct ccactttgca 1080 
ggcgaagtta tggatatcaa tgcccacacg ggtggcttta acatcacttc tgccctctgt 1140 
accggctggg tggcgggaag tctgcattat gattaa 1176 

<210> 104 
40 <211> 696 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 104 

atgctgaaat gggaagactt gcctgtggaa atgaaatcaa gcgaggttga gtcttactac 60 
cagcttgtct ctaaaaggaa gggttcgctg attttoaagc gttgcttgga ctgggttttg 120 
gccttggtgc ttacatgggt tctaacttct cccatctttc tcatcttgag catttggatc 180 
aagttggata gcaaggggcc agtgatttac aagcaagagc gtgtgaccca gtacaaccgt 240 
cggttcaaga tttggaagtt ccgtaccatg gtgacggatg cggataaaaa aggaagtctg 300 
gtgacttctg ctaacgatag ccgtattacc aaggttggaa atttcatccg acgtgtccgt 360 
ttggacgaac tgcctcagtt ggtcaatgtc cttaaaggtg agatgtcctt tgtcggtaca 420 
cgacctgaag tgccacgtta tacagagcag tatagccctg aaatgatggc aaccttgctc 480 
ttgcaagcag gaattacctc tccagccagc atcaactaca aggatgagga caccatcatc 540 
agtcaaatga cggagaaagg tctgtcagtt gatcaggcct atgtggagca tgttcttcct 600 
» gaaaagatgc gctataacct cgcctatctc cgagagttta gtttctttgg ggacatcaaa 660 
atcatgtttc aaaccgtgtt tgaggtacta aaataa ggg 
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<210> 105 
<211> 423 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 105 

atgactagtc cactattaga atctagacgc caactccgta aatgcgcttt tcaagctctc 60 
atgagccttg agttcggtac ggatgtcgaa actgcttgtc gtttcgccta tactcatgat 120 
cgtgaatata cggatgtaca acttccagcc tttttgatag acctcgtttc tggtgttcaa 180 
gctaaaaagg aagaactaga taagcaaatc actcagcatt taaaagcagg ttggaccatt 240 
gaacgcttaa cgctcgtgga gagaaacctc cttcgcttgg gagtctttga aatcacttca 300 
tttgacactc ctcagctggt tgctgttaat gaagctatcg agcttgcaaa ggacttctcc 360 
gatcaaaaat ctgcccgttt tatcaatgga ctgctcagcc agtttgtaac agaagaacaa 4 20 
t aa 

15 423 

<210> 106 

<211> 3540 

<212> DNA 
20 <213> Streptococcus pneumoniae 

<400> 106 

atgtatttaa aggaaatcga aattcagggg ttcaagtctt ttgctgataa gaccaaggtc 60 
gtttttgacc aaggtgtgac ggcagttgtt ggacccaatg gatctggaaa gtccaatatt 120 
acagaaagtc tgcgttgggc tttgggggag tctagtgtca agagtctccg tgggggcaag 180 
O atgccggatg tcatctttgc tggaacagaa agtcgcaaac cgctcaatta tgcttctgta 240 
gttgtgactc tggataatca tgacggattt atcaaggatg caggtcaaga aatcagggtg 300 
gaacgccata tctatcgtag tggagatagc gaatacaaga ttgacggcaa gaaagtccgt 360 
ctgcgtgatattcatgacct cttcttggat actggattgg gacgagattc cttctctatt 420 
™ f^ttcccaag ggaaggttga ggagattttt aattccaagc ctgaggaacg acgagctatt 4 80 
tttgaagaag ctgctggagt tttaaaatac aagactcgca gaaaagaaac cgagagtaaa 540 
ctgcagcaaa ctcaggataa tctggaccgc ttagaggaca ttatctacga gttggataat 600 
caaatcaagc ctcttgagaa gcaagctgag aatgcccgta agtttttaga cttggaagga 660 
caacgtaagg ctatttattt agacgttctg gttgctcaaa tcaaggaaaa taaggcagaa 720 
ctagagtcga cagaagaaga gttggctcag gttcaagaac tcttgatgag ttattaccaa 780 
aagcgtgaaa aattagaaga agaaaatcaa actcttaaaa agcaacgcca agatttacag 840 
gctgaaatgg ccaaagacca aggcagtttg atggacttga ctagtctgat tagtgattta 900 
gaaagaaaat tagccctatc gaaactggag tccgagcaag tggccctgaa tcaacaggag 960 
gcacaagctc gtttggctgc tttggaggat aagagaaatt cactcagcaa agaaaagtat 1020 



gataaagaaa gctctttagc tctgttagag ggaaatctag tccaaaataa tcaaaaactc 1080 
W aatcgtttag aagctgaatt gctggctttc tcagacgatc ctgatcagat gattgagctc 1140 
ttacgtgaac gctttgtagc tcttttacaa gaagaagcgg atgtctcaaa ccagttgacc 1200 
cgtattgaga atgagttgga aaatagtcgt cagctttctc aaaaacaagc agatcaacta 12 60 
gaaaagctga aagagcaatt agctacagct aaagagaagg ctagtcagca aaaagacgag 1320 
cttgaaactg ccaaggtgca ggttcagaaa ttattggctg actatcaagc tattgccaag 1380 
gagcaagagg agcagaaaac ttcctatcaa gctcaacaaa gtcaactctt tgaccgtctg 144 0 
gatagtctca aaaacaagca ggccagagct caaagtttgg aaaatatcct gagaaatcat 1500 
agtaactttt atgcaggtgt taagagtgtt ctccaagaaa aagatcgcct aggtgggatt 1560 
attggtgcag tcagtgagca tctgaccttt gatgtttatt atcaaactgc cctagagatt 1620 
gccttagggg caagtagcca gcatatcatc gtagaagatg aagagtcggc aaccaaagct 1680 
attgattt cc tcaaacgaaa cagagtcggt cgtgcaacct ttcttccttt gaccactatt 174 0 
aaggcgcgta cgatttctag tcagaaccaa gatgctatcg ctgtaagccc -aggtttcctt 1800 
gggatggcag atgagttggt gacttttgat actagactgg aagccatttt caagaacttg 1860 
ctagctacga cggctatctt tgataccgta gaacatgcgc gtgaagctgc tcgacaagtt 1920 
cgttatcagg ttcgtatggt gacattggat gggacagaat tacgcacggg tggttcctat 1980 
33- gcgggtggtg ccaatcgcca aaataacagt attttcatca agccagaact ggagcaatta 204 0 
caaaaagaaa ttgctgcaga tgaagcaagc ttgggttcag aagaagcggc tttgaagacc 2100 
ttgcaagacc agatggctgc attgacagaa aga-ttagaag ccatcaaatc tcaaggagag 2160 
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caggcacgta ttcaggagca aggcttgtcc ctcgcttatc agcaaactag tcagcaagtt 2220 
gaagaactgg aaactctttg gaaactccaa gaagaggaaa tagatcgtct ttctgaggga 2280 
gattggcaag cggataagga aaaatgtcaa gagagccttg ctactatcgc cagtgacaag 2340 
caaaatctgg aagctgagat tgaagaaatt aagtctaata aaaacgccat ccaagaacgc 2400 
tatcaaaatt tgcaggaaga ggtggcgcaa gctcgcttgc ttaagacaaa actgcaaggg 24 60 
caaaaacgtt atgaagtagc tgatattgag cgtttaggca aggaattgga caatcttaat 2520 
atcgaacaag aagaaattca gcgcatgctc caagaaaaag ttgacaatct tgagaaggtt 2580 
gatacagaat. tgctcagtca acaggcggaa gaatccaaaa ctcagaaaac aaatctccaa 2640 
caaggtttga ttcgcaagca gtttgagttg gatgatatag aaggtcaact ggatgatatt 2700 
gccagtcact tggatcaagc tcgccagcag aatgaggagt ggattcgcaa gcaaacacgt 2760 
gctgaagcca agaaagaaaa ggtcagcgag cgcttgcgcc atctacaaaa tcaattaaca 2820 
gaccagtacc agattagcta tactgaagca ctagaaaagg cacatgaatt ggaaaacctc 2880 
aatctggcag agcaagaggt gcaggattta gagaaggcta ttcgctcatt gggacctgtc 294 0 
aacttggaag ctattgacca gtacgaagaa gttcacaacc gtctggactt tctaaatagt 3000 
cagcgagatg atattttgtc agcgaaaaat ctgctccttg aaaccattac agagatgaat 3060 
gatgaggtca aggaacgctt taaatcaacc tttgaagcta ttcgtgagtc ctttaaagtg 3120 
acctt.caagc agatgtttgg cggaggtcag gcagacttga tattgactga gggcgacctt 3180 
ttaacagctg gtgtggagat ttctgttcaa cctccaggta agaaaatcca gtcgcttaac 3240 
ctcatgagtg gtggtgaaaa agctctatcg gctcttgcct tgcttttctc cattattcgt 3300 
gtcaagacca ttccttttgt catcttggat gaggtggaag ctgcgctgga tgaagccaaf 3360 
gttaaacgtt ttggggatta cctcaaccgc tttgacaagg acagccagtt tatcgtcgta 3420 
acccaccgta agggaaccat ggcagcggct gattccatct atggagtgac catgcaagaa 3480 
tcaggtgtct caaaaattgt ttcggttaag ttaaaagatt tagaaagtat tgaaggatga 3540 

25 <210> 107 
<211> 1344 
<212> DNA 

<213> Streptococcus pneumoniae 
30 <400> 107 

atgacaaaac gtgtaacgat tattgacgta aaagactatg ttggtcagga agtgacgatt 60 
ggcgcttggg ttgccaacaa atcaggaaaa ggaaaaatcg ctttcttaca attgcgtgat 120 
ggaacagcct tctttcaagg tgtggctttt aaaccaaact ttgtcgaaaa atttggtgaa 180 
gaagtgggac ttgagaagtt tgatgttatc aaacgcttga gccaagaaac gtctgtttat 240 
^ gtgacaggta ttgtcaaaga ggacgaacgt tctaaatttg gctatgagtt ggacatcaca 300 
gacatcgaag tgatcggtga atctcaagac tacccaatca caccaaaaga acacggaaca 



10 



15 



20 



360 

gactttttga tggataaccg tcacttgtgg ctacgctctc gtaagcaagt agctgtgttg 420 
caaatccgta acgctattat ctatgcaact tatgagttct ttgacaagaa cggttttatg 480 
aagtttgaca gcccaattct ttcaggaaat gcggcagaag attctacaga actctttgaa 540 
actgactact tcggaacgcc agcctacttg agccaatcag gtcagcttta cctagaagca 600 
ggggctatgg ctcttggtcg tgtctt'tgac tttggtccag ttttccgtgc tgaaaaatca 660 
aaaacacgcc gtcacttgac tgagttctgg atgatggatg ctgagt'actc atacttgaca 720 
catgatgagt cgcttgactt gcaagaagct tatgtgaaag ctcttctaca aggtgttctt 780 
gaccgcgcgc ctcaagcctt ggaaaccttg gaacgtgata cagaactctt gaaacgctac 840 
attgcagagc cattcaaacg tatcacttac gatcaagcca ttgacctctt gcaagagcat 900 
gaaaatgatg aagatgctga ctacgagcat cttgagcatg gtgatgactt tgggtcacca 960 
cacgaaactt ggatttcaaa ccactttggt gtgccaacat ttgtcatgaa ctatccagca 1020 
gccatcaagg ccttctacat gaaaccagtt cctggaaatc cagagcgcgt gctttgtgca 1080 
gacttgcttg ctccagaagg ctatggagaa attatcggtg ggtctatgcg tgaggaagat 114 0 
tacgatgccc ttgtcgctaa gatggatgaa cttggcatgg atcgtacaga atatgaattc 1200 
taccttgacc ttcgtaaata cggtacagtt ccacacggag gatttggtat cggtatcgaa 1260 
cgtatggtaa ccttcgcagc aggaacaaaa catatccgtg aagctattcc attcccacgt 1320 
atgttgcacc gtatcaaacc ataa 1344 

55 <210> 108 
<211> 927 
<212> DNA 
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<213> Streptococcus pneumoniae 
<400> 108 

atgtctgaaa aattagtaga aatcaaagat ttagaaattt ccttcggtga aggaagtaag 60 
aagtttgtcg cggttaaaaa tgctaacttc tttatcaaca agggagaaac tttctcgctt 120 
gtaggtgagt ccggtagtgg gaaaacaact attggtcgtg ctatcatcgg tctaaatgat 180 
acaagtaatg gagatatcat ttttgatggt caaaagatta atggtaagaa atcgcgtgaa 240 
caagctgcgg aattgattcg tcgaatccag atgattttcc aagaccctgc cgcaagtttg 300 
aatgaacgtg cgactgttga ttatattatt tctgaaggtc tttacaatca ccgtttattt 360 
aaggatgaag aagaacgtaa agagaaagtt caaaatatta tccgtgaagt aggtcttctt 420 
gctgagcact tgactcgtta ccctcatgaa ttctcaggcg gtcaacgtca acgtatcggt 480 
attgcccgtg ccttggtcat gcaaccagac tttgttattg cagatgagcc aatttcagcc 540 
ttggacgttt ctgtacgtgc ccaagtcttg aacttgctca aaaaattcca aaaagagctc 600 
ggcctgacct atctcttcat cgcccatgac ttgtcggttg ttcgctttat ttcagatcgt 660 
atcgcagtta tttacaaggg tgttattgta gaggttgcag aaacagaaga attgtttaac 720 
aatccaattc acccatatac tcaagccttg ctttcagcgg taccaatccc agatccaatc 780 
ttggaacgta agaaggtctt gaaggtttac gacccaagtc aacacgacta tgagactgat 84 0 
aagccgtcta tggtagaaat ccgtccaggt cactatgttt gggcgaacca aaccgaattg 900 
gcacgttatc aaaaaggact aaactag - 927 

<210> 109 
<211> 1275 
<212>" DNA 

<213> Streptococcus pneumoniae 
<400> 109 

atgaagataa gttggaatgg attttctaaa aaatcatacc aagagcgcct cgagctgcta 60 
aaagctcagg cgctccttag tcctgagaga caagctagtc tggagaagga tgaacagatg 120 
agtgtgactg tggcagacca gctgagtgag aatgtggtgg gaactttttc tctgccttat 180 
tcgctggttc cggaggtact tgtcaacggt caggaataca ccgttcccta tgtgacagaa 240 
gaaccctctg tggttgcggc ggccagctat gccagcaaaa tcatcaagcg tgcaggtggt 300 
tttactgcac aagtccatca gcgccagatg attgggcagg tagcccttta tcaaattgct 360 
aatcctaaac tagcgcaaga gaagattgcc agcaagaaag cggagctctt ggagcttgcc 420 
aatcaagcct atccttctat cgttaaacgt gggggtgggg cgcgtgatct gcatgtcgag 480 
cagataaaag gcgaaccaga ctttctcgtt gtttatattc atgtcgatac ccaggaagcc 540 
atgggtgcca atatgctcaa caccatgctg gaagccttga aaccagtctt agaagaactc 600 
agtcagggac agagtctcat gggaatcctg tccaactacg cgactgattc tctggtgact 660 
gcaagctgtc gcatcgcctt tcgctacttg agccgccaaa aggatcaagg acgagagatt 720 
gcggagaaaa ttgcgttggc tagtcagttt gcgcaggctg atccttaccg agctgctact 780 
cataataaag gaatttttaa tggtattgat gcgattttga ttgccactgg taatgactgg 840 
cgtgccatcg aagctggggc ccatgccttt gccagtcgag atggacgcta tcaaggtctt 900 
agctgctgga cgctggacct tgaaagagaa gaattggtcg gtgagatgac cctgcccatg 960 
cctgtagcga ctaagggtgg ctctatcggc ctcaacccac gtgtagctct cagtcatgat 1020 
ctactaggaa atccttctgc cagagaatta gcccagatta tcgtgtccat cggtcttgct 1080 
caaaattttg cagccctcaa agccttggta agtacgggca tccagcaagg ccacatgaaa 1140 
ctacaggcca aatccctagc tctcctagct ggggctagtg aatctgaagt tgctccccta 1200 
gtagagcgcc tcatctcaga taaaaccttt aacctagaga cagcccagcg ctatctcgaa 1260 
aatttaagat cataa . 12 75 

<210> 110 
<211> 789 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 110 

atgccaatta catcatt'aga aataaaggac aagacttttg gaactcgatt cagaggtttt 60 
gatccagaag aagtegatga atttttagat attgtggttc gtgattacga agatcttgtg 120 



46 
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cgtgcgaatc atgataaaaa tttgcgtatt aagagtttag aagagcgttt gtcttacttt 180 
gatgaaataa aagattcatt gagccagtct gtattgattg ctcaggatao agctqJaaaa 240 
gtgaaacagg cggcgcatga acgttcaaac aatatcattc atcaagcaja «2S?a2a loo 
caa Cgcttgt tggaagaagc taaatataag gcaaacgaga ttcttcg^a SaacS? 360 
aatgctaaga aagtcgctgt tgaaacagaa gaattgaaga acaagagccg S *20 
caacgtctca aatctacaat tgagagtcag ttggctattg ttgaatcttc agaSgwIa 430 
gatattctcc gtccaacagc tacttatctt caaaccagtg atgaagccU ?2aaI2?S 540 
gttagcgaag tacttggaga accgattcca gctccaattg aagaagaacc aatSESS 600 
acacgtcagt tc.tctcaagc agaaatggca gaattacaag ctcgtattga SrtS™? tin 
aaagaattgt ctgaatttga agctcagatt aaacaggaag tggaagcJcc ' Z ill 

lZttfa C aagtt9aa9a aga9CCtCtg CtCatCC ^ ? IZIZ * ill 



789 



<210> 111 
15 <211> 1728 
<212> DNA 

<213> Streptococcus pneumoniae 



20 



<400> 111 

ta\ 9 at C a ta tL g lltlT^t ^"taatg «?t«tttt agttctggct 



. tatgtagtgg caatctttct acgtaagcga l^ ggg l £ a^aggc III g ™J £ 0 
Totllllll " 0tat *" a *<*tcc.gt. aatgatgaag ?.ga«gctgt KSaSg III 
trtlll 9 ga ° aaagtca agtggctttc cgtgaatgga atcaaaaatg ggtcgattta 240 
25 teJSSS tt^" 9CCga tat ^ a f aat -tctctttg aagcagaagg g ? a taacca a 

tcatttcgtt ttctcaaggc cagtcatcaa attgac.caaa ttgagagtca aattacttto 360 
attgaagaag atattgcggc aattcgcaat gctttggcag acttaglgaa gcaagaa"? 420 
aaaaatagtg gtcgtgttct tcatgctttg gatttatttg aggaacttc. SSJJgS til 
gctgaaaatt cagaacagta tggtcaagcc ttggatgaaa ttgaaaaaca atJagaaaat 540 
atccaatctg aattttcaca atttgtaacc ttgaattcat cgggtgaccc Jgtqqaaacc 600 
30 gcagtgattt tggataatac agaaaatcac attttggcct tSgtLS? tSgKcqt 660 
gttccagcct tggttacgac gctttctaca gaattgccag atcaattaca gga^tg^a 120 
gccggttatc gtaaactaat tgatgctaat tatcattttg ttgaaacgga tattgaaqca 7«o 
cgtttccact tgctttatga agcattcaag aaaaaccaag agaatattcg tca^tggaa 840 
35 lltlttlttl ° Cgaatatga Wggacag gcacaagagg aaatcaatgc cttg^gat 

iSScrfS ? agaaa " gC t^ctcagaaa gtagtggaaa atctacttgc aac?cttcca 960 
acttaccttc aacatatgaa agagaataat actttattgg gagaagatat tgcacqttta 1020 
llZaalttl 3 " taCttCC twcagrt gcaagccatg t'tcgtcgtat SjSS? 10S0 
ttagagagtt ttgaggcagc tattgttgag gtaacttcaa atcaagaaga accaacccaa 1140 
gcttattcag ttcttgaaga aaatcttgag gatttacaaa ctcaactaaa agaSSJ" 12SS 
40 gatgagcaaa tttcagttag tgagcgcctg acacaaattg agaaagatga tattaaSca 1260 
cgtcaaaagg ccaatgttta tgtcaatcgt ctccatacta tcaagcgata cato«2S 1320 
cgcaatctgc caggtattcc acaaactttc ttgaagttat tcttLcggc aJaSataa? lllo 
accgaggatt taatggttga gttagaacaa aaaatgatta acattga^c tgS^S 1440 
45 lit?? 3 *** " g " acgaa tgatatggaa gctttagaaa cggaaactta taatattg?a 1500 
45 caatatgcaa ctttgacaga goaactcttg caatattcta accgctatcg ctcatttea? 1560 
gaacgcattc aagaagcatt taacgaagct ttagatattt ttgaaaaaga atttgattit lltl 

cgc\ 9 ttatt cct 9a ata a t 9a a "ggaagtgg cgagcctSg t^Sc^ 6 0 ° 

cgctttgtta cctcatatga gaaaacacgt gaaacgattc gtttttaa 1728 

50 <210> 112 
<211> 2403 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <400> 112 

aL t3 ! aaatg f taaaagaa ttggtggaca ttgatgtgcc atcacaagag 60 • 
ttggctgaaa aaatgtoaac tacaggaatc gaggtagagg gtgtcgaatc accagctgct 120 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



ggtctctcaa aaattgtcgt cggtgaggtc ttgtcttgcg aagatgtgcc agagactcac 180 
ctccatgttt gtcaggttaa cgttggcgaa gaagagcgtc agatcgtttg tggtgcccca 240 
aatgtgcgtg ctgggatcaa ggtcatggtg gctcttccag gagctcgtat cgctgataac 300 
tacaaaatca aaaaaggaaa aatccgtggt ttggagtcac ttggaatgat ctgttcactt 360 
ggtgaattgg gaatttctga ctcagttgtg cctaaggaat tcgcagatgg catccaaatc 420 
ttgcctgaag atgccgtgcc aggtgaggaa gtcttttctt acctagactt ggatgatgaa 480 
atcatcgaac tttccatcac accaaaccgt gcagatgccc tttctatgtg tggagtggct 540 
cacgaagtgg cagccatcta tgacaaggca gtcaacttta aagaatttac tctaacagaa 600 
actaatgaag ctgcggcaga tgccctttct gtcagcattg agacagacaa ggcgccttac 660 
tatgcagctc gtatcttgga caatgtgacc atcgcaccaa gtccacaatg gttgcaaaac 720 
cttctcatga acgaaggaat ccgtcccatc aataacgtag tggacgtgac caactacatc 780 
ctgctctatt ttggtcaacc aatgcatgcc tttgacttgg ataactttga agggactgac 840 
atccgtgtgc gtgaagcgcg tgctggtgaa aaattggtga ccttggacgg tgaa'gaacgt 900 
gacttggacg tgaatgacct agtcatcact gtcgcagaca agccagtagc ccttgcaggt 960 
gtcatgggtg gtcaagcaac agaaatctct gaaaaatcta gtcgtgttgt ccttgaagct 1020 
gctgttttca atggcaaatc tatccgtaag acaagtggtc gcctgaacct tcgttctgag 1080 
tcatcttctc gctttgaaaa aggaattaat gtggcaacag ttaatgaagc ccttgatgcg 1140 
gcagctagcc tgattgcgga acttgcaggt gcgacggtgc gtaagggcat cgtttcagcg 1200 
ggtgagcttg atacttcaga tgtagaagtt tcttcaaccc ttgctgatgt taaccgtgtc 1260 
ctcggaactg agctgtctta tgctgatgta gaagacgtct tccgtcgtct tggctttggt 1320 
ctttctggaa atgcagacag ctttacagtc agagtcccac gtcgtcgttg ggatatcaca 1380 
atcgaagctg acctctttga agaaattgct cgtatctatg gttatgaccg cttgccaact 1440 
agtctaccaa aagacgatgg tacagcaggt gaattgacag ccacacaaaa actccgccgt 1500 
caagttcgta ctattgctga aggagcaggt ttgacagaaa tcatcaccta tactctaaca 1560 
actcctgaaa aagcagttga gtttacggct caaccaagta accttacgga actcatgtgg 1620 
ccaatgactg tggatcgttc agtcctccgt caaaatatga tttcaggtat ccttgatacc 1680 
gttgcctaca acgtggctcg taagaataaa aacttggccc tttacgagat tggaaaagtc 1740 
tttgaacaaa caggtaatcc aaaagaagaa cttccaaatg aaatcaacag ttttgccttt 1800 
gccttgacag gcttggttgc tgaaaaagat ttccaaacag cagcagttcc agttgatttc 1860 
ttctatgcta agggaatcct tgaagcccta tttactcgtt tgggactcca agtaacctat 1920 
acagcaacat ctgaaatcgc tagccttcat ccaggtcgta cagccgtgat ttcactcggt 1980 
gaccaagttc ttggtttcct tggccaagtg catccagtca ctgccaaggc ttacgatatt 2040 
ccagaaacgt atgtggctga gcttaacctt tcagctatcg aagctgcgct tcagccagcg 2100 
actccatttg tagaaatcac caaattcccg g'cagtcagcc gtgacgttgc ccttctcctc 2160 
aaggcagaag tgactcatca agaagttgta gatgctatcc aagctgccgg cgtgaaacgt 2220 
ttgacagata tcaaactctt tgacgtcttc tcaggtgaga aattgggact tggtatgaag 2280 
tcaatggctt atagcttgac cttccaaaat ccagaagata gcttaacgga cgaagaagtc 234 0 
gcacgctata tggaaaaaat ccaagcatcg ctcgaagaaa aagtcaatgc agaagtgcgt 2400 



taa 

<210> 113 
<211> 543 
<212> DNA 

<213> Streptococcus pneumoniae 



2403 



<400> 113 

atgttagaaa acgatattaa aaaagtcctc gtttcacacg atgaaattac agaagcagct 60 
aaaaaactag gtgctcaatt aactaaagat tatgcaggaa aaaatccaat cttagttggg 120 
attttaaaag gatctattcc ttttatggct gaattggtca aacatattga tacacatatt 180 
gaaatggact tcatgatggt ttctagctac catggtggaa cagcaagtag tggtgttatc 240 
aatattaaac aagatgtgac tcaagatatc aaaggaagac atgttctatt tgtagaagat 300 
atcattgata caggtcaaac tttgaagaat ttgcgagata tgtttaaagc aagagaagca 360 
gcttctgtta aaattgcaac cttgttggat aaaccagaag gacgtgttgt agaaattgag 420 
gcagactata cctgctttac tatcccaaat gagtttgtag taggttatgg tttagactac 480 
55 aaagaaaatt atcgtaatct tccttatatt ggagtattga aagaggaagt gtattcaaat 540 
ta ? 543 



48 



WO 01/49721 



PCT/US00/35604 



<210> 114 
<211> 235 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 114 

Met lie Tyr Ala Gly He Leu Ala Gly Gly Thr Gly Thr Arg Met Gly' 
15 10 15 

•10 He Ser Asn Leu Pro Lys Gin Phe Leu Glu Leu Gly Asp Arg Pro He 
20 25 30 

Leu He His Thr He Glu Lys Phe Val Leu Glu Pro Ser lie Glu Lys 
35 40 45 

15 

He Val Val Gly Val His Gly Asp Trp Val Leu His Ala Glu Asp Leu 
50 55 60 

Val Asp Lys Tyr Leu Pro Leu His Lys Glu Arg He He lie Thr Lvs 
20 65 70 75 80 

Gly Gly Ala Asp Arg Asn. Thr Ser lie Glu Asn lie lie Glu Ala lie 
85 '90 95 

25 Asp Ala Tyr Arg Pro Leu Thr Pro Glu Asp lie Val Val Thr His Asp 
100 105 no 

Ser Val Arg Pro Phe lie Thr .Leu Arg Met He Gin Asp Ser He Lys 
115 120 125 

30 

Leu Ala Gin Asn His Asp Ala Val Asp Thr Val Val Glu Ala Val Asp 
130 135 140 

' Thr He Val Glu Ser Thr Asn Gly Gin Phe lie Thr Gly lie Pro Asn 
35 145 150 155 160 

Arg Ala His Leu Tyr Gin Gly Gin Thr Pro Gin Thr Phe Arg Cys Lys 
165 -170 175 

40 Asp Phe Met Asp Leu Tyr Gly Ser Leu Ser Asp Glu Glu Lys Glu lie 
180 185 190 

Leu Thr Asp Ala Cys Lys lie Phe Val lie Lys Gly Lys Asp Val Ala 
195 200 205 

45 

Leu Ala Lys Gly Glu Tyr Ser Asn Leu Lys lie Thr Thr Val Thr Asp 
210 215 220 

Leu Lys lie Ala Lys Ser Met lie Glu Lys Asp 
50 225 230 235 

<210> 115 
<211> 185 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 115 

Met Ala Asn Val He He Glu Lys Ala Lys Glu Arg Met Thr Gin Ser 
1 5 10 15 

5 His Gin Ser Leu Ala Arg Glu Phe Gly Gly He Arg Ala Gly Arq Ala 
20 25 30 

Asn Ala Ser Leu Leu Asp Arg Val His Val Glu Tyr Tyr Gly Val Glu 
35 40 45 

10 

Thr Pro Leu Asn Gin He Ala Ser He Thr He Pro Glu Ala Arg Val 
50 55 60 

Leu Leu Val Thr Pro Phe Asp Lys Ser Ser Leu Lys Asp He Glu Arg 
15 65 70 75 80 

Ala Leu Asn Ala Ser Asp Leu Gly He Thr Pro Ala Asn Asp Gly Ser 
85 90 95 

20 Val He Arg Leu Val He Pro Ala Leu Thr Glu Glu Thr Arg Arg Asp 
100 105 no- 

Leu Ala Lys Glu Val Lys Lys Val Gly Glu Asn Ala Lys Val Ala Val 
25 ll $ 120 125 

Arg Asn He Arg Arg Asp Ala Met Asp Glu Ala Lys Lys Gin Glu Lys 
130 135 140 

Ala Gin Glu He Thr Glu Asp Glu Leu Lys Thr Leu Glu Lys Asp He 
30 14 5 150 155 160 

Gin Lys Val Thr Asp Asp Ala Val Lys His He Asp Asp Met Thr Ala 
165 170 175 

35 Asn Lys Glu Lys Glu Leu Leu Glu Val 
180 185 



<210> 116 
40 <211> 450 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 116 

45 Met Gly Lys Tyr Phe Gly Thr Asp Gly Val Arg Gly Glu Ala Asn Leu 
1 5 .10 15 

Glu Leu Thr Pro Glu Leu Ala Phe Lys "Leu Gly Arg Phe Gly Gly Tyr 

50 

Val Leu Ser Gin His Glu Thr Glu Ala Pro Lys Val Phe Val Gly Arg 

35 40 45 

Asp Thr Arg lie Ser Gly Glu Met Leu Glu Ser Ala Leu Val Ala Glv 
55 50 55 60 

Leu Leu Ser Val Gly lie His Val Tyr Lys Leu Gly Val Leu Ala Thr 



50 



WO 01/49721 



PCT/US00/35604 



65 



70 



75 



80 



Pro Ala Val Ala Tyr Leu Val Glu Thr Glu Gly Ala Ser Ala Gly Val 
85 90 95 

5 

Met He Ser Ala Ser His Asn Pro Ala Leu Asp Asn Gly He Lys Phe 
100 105 no 

Phe Gly Gly Asp Gly Phe Lys Leu Asp Asp Glu Lys Glu Ala Glu He 
10 115 120 125 

Glu Ala Leu Leu Asp Ala Glu Glu Asp Thr Leu Pro Arg Pro Ser Ala 
130 135 140 

15 Glu Gly Leu Gly He Leu Val Asp Tyr Pro Glu Gly Leu Ara Lvs Tvr 
145 150 155 160 



20 



Glu Gly Tyr Arg Val Ser Thr Gly Thr Pro Leu Asp Gly Met Lys Val 
165 170 175 

Ala Leu Asp Thr Ala Asn Gly Ala Ala Ser Thr Ser Ala Arg Gin He 
180 185 190 



Phe Ala Asp Leu Gly Ala Gin Leu Thr Val He Gly Glu Thr Pro Asp 
25 195 200 205 

Gly Leu Asn lie Asn Leu Asn Val Gly Ser Thr His Pro Glu Ala Leu 
210 215 220 

30 Gin Glu Val Val Lys Glu Ser Gly Ser Ala He Gly Leu Ala Phe Asp 
225 230 .235 240 



35 



Gly Asp Ser Asp Arg Leu He Ala Val Asp Glu Asn Gly Asp He Val 
245 250 . 255 

Asp Gly Asp Lys He Met Tyr He lie Gly Lys Tyr Leu Ser Glu Lys 
260 265 . 270 



. Gly Gin Leu Ala Gin Asn Thr He Val Thr Thr Val Met Ser Asn Leu 
40 275 280 285 

Gly Phe His Lys Ala Leu Asn Arg Glu Gly He Asn Lys Ala Val Thr 
290 295 300 . 

45 Ala Val Gly Asp Arg Tyr Val Val Glu Glu Met Arg Lys Ser Gly Tyr 
305 310 315 320 



50 



Asn Leu Gly Gly Glu Gin Ser Gly His -Val He Leu Met Asp Tyr Asn 
325 330 335 

Thr Thr Gly Asp Gly Gin Leu Ser Ala Val Gin Leu Thr Lys He Met 
340 345 350 



Lys Glu Thr Gly Lys Ser Leu Ser Glu Leu Ala Ala Glu Val Thr He 
55 355 360 365* 

Tyr Pro Gin Lys Leu Val Asn He Arg Val Glu Asn Val Met Lys Glu 
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370 375 380 

Lys Ala Met Glu Val Pro Ala He Lys Ala He He Glu Lys Met Glu 
5 385 390 395 400 

Glu Glu Met Ala Gly Asn Gly Arg lie Leu Val Arg Pro Ser Gly Thr 
405 410 415 

Glu Pro Leu Leu Arg Val Met Ala Glu Ala Pro Thr Thr Glu Glu Val 
10 420 425 430 

Asp Tyr Tyr Val Asp Thr He Thr Asp Val Val Arg Ala Glu He Gly 
435 440 445 

15 lie Asp 
450 



<210> 117 
20 <211> 234 
<212> PRT 

<213> Streptococcus pneumoniae 
<40'0> 117 

25 Met Lys Lys He Leu lie Val Asp Asp Glu Lys Pro lie Ser Asp He 
1 5 ,10 15 

lie Lys Phe Asn Met Thr Lys Glu Gly Tyr Glu Val Val Thr Ala Phe 
30 20 

Asn Gly Arg Glu Ala Leu Glu Gin Phe Glu Ala Glu Gin Pro Asp lie 
35 40 45 

He lie Leu Asp Leu Met Leu Pro Glu lie Asp Gly Leu Glu Val Ala 
35 50 55 60 

Lys Thr He. Arg Lys Thr Ser Ser Val Pro lie Leu Met Leu Ser Ala 
65 70 75 80 

40 Lys Asp Ser Glu Phe Asp Lys Val lie Gly Leu Glu Leu Gly Ala Asp 

85 90 95 

Asp Tyr Val Thr Lys Pro Phe Ser Asn Arg Glu Leu Gin Ala Arg Val 
100 105 no 

45 

Lys Ala Leu Leu Arg Arg Sef Gin Pro Met Pro Val Asp Gly Gin Glu 
115 120 125 

Ala Asp Ser Lys Pro Gin Pro lie Gin lie Gly Asp Leu Glu He Val 
50 130 135 140 

Pro. Asp Ala Tyr Val Ala Lys Lys Tyr Gly Glu Glu Leu Asp Leu Thr 
145 150 155 160 

55 His Arg Glu Phe Glu Leu Leu Tyr His Leu Ala Ser His Thr Gly Gin 

165 170 175 
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Val He Thr Arg Glu His Leu Leu Glu Thr Val Trp Gly Tyr Asp Tyr 
180 185 190 

Phe Gly ,Asp Val Arg Thr Val Asp Val Thr Val Arg Arg Leu Arg Glu 
5 195 , 200 205 

Lys He Glu Asp Thr Pro Ser Arg Pro Glu Tyr He Leu Thr Arg Arg 
210 215 220 

10 Gly Val Gly Tyr Tyr Met Arg Asn Asn Ala 
225 230 



<210> 118 

15 <211> 368 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 118 

20 Met Glu Glu He Leu Cys He Gly Cys Gly Ala Thr He Gin Thr Thr 
1.5. 10 15 

Asp Lys Ala Gly Leu Gly Phe Thr Pro Gin Ser Ala Leu Glu Lys Gly 

25 

Leu Glu Thr Gly Glu Val Tyr Cys Gin Arg Cys Phe Arg Leu Arg His 
35 40 45 

Tyr Asn Glu He Thr Asp Val Gin Leu Thr Asn Asp Asp Phe Leu Lvs 
30 50 55 60 

Leu Leu His Glu Val Gly Asp Ser Asp Ala Leu Val Val Asn Val He 
65 70 75 so 

35 Asp He Phe Asp Phe Asn Gly Ser Val lie Pro Gly Leu Pro Arg Phe 

85 .90 95 

Val Ser Gly Asn Asp Val Leu Leu Val Gly Asn Lys Lys Asp He Leu 
n 100 105 HO 

40 

Pro Lys Ser Val Lys Ser Gly Lys He Ser Gin Trp Leu Met Lys Arg 
115 120 125 

Ala His Glu Glu Gly Leu Arg Pro Val Asp Val Val Leu Thr Ser Ala 
45 130 135 140 

Gin Asn Lys His Ala He Lys Glu Val He Asp Lys He Glu His Tyr 
145 150 155 160 

50 Arg Lys Gly Arg Asp Val Tyr Val Val Gly Val Thr Asn Val Gly Lys 

165 . 170 175 

Ser Thr Leu lie Asn Ala lie He Gin Glu He Thr Gly Asp Gin Asn 
180 185 190 

55 

Val ne T hr Thr Ser Arg Phe Pro Gly Thr Thr Leu' Asp Lys He Glu 
195 200 205 



53 



WO 01/49721 



PCT/US00/35604 



He Pro Leu Asp Asp Gly Ser Tyr He Tyr Asp Thr Pro Gly He He 
210 215 220 

5 His Arg His Gin Met Ala His Tyr Leu Thr Ala Lys Asn Leu Lvs Tvr 
225 230 235 2 40 



10 



Val Ser Pro Lys Lys Glu He Lys Pro Lys Thr Tyr Gin Leu Asn Pro 

245 250 255 

Glu Gin Thr Leu Phe Leu Gly Gly Leu Gly Arg Phe Asp Phe He Ala 
260 265 270 



Gly Glu Lys Gin Gly Phe Thr Ala Phe Phe Asp Asn Glu Leu Lys Leu 
15 275 280 285 

His Arg Ser Lys Leu Glu Gly Ala Ser Ala Phe Tyr Asp Lys His Leu 
290 295 300 

20 Gly Thr Leu Leu Thr Pro Pro Asn Ser Lys Glu Lys Glu Asp Phe Pro 
305 310 315 320 



25 



Arg Leu Val Gin His Val Phe Thr He Lys Asp Lys Thr Asp Leu Val 
325 330 335 

He Ser Gly Leu Gly Trp He Arg Val Thr Gly Thr Ala Lys Val Ala 
340 345 350 



Val Trp Ala Pro Glu Gly Val Ala Val Val Thr Arg Lys Ala He He 
30 355 360 365 



35 

<210> 119 
<211> 486 
<212> PRT 

<213> Streptococcus pneumoniae 

40 

<400> 119 

Met Tyr Pro Asp Asp Ser Leu Thr Leu His Thr Asp Leu Tyr Gin He 
15 10 15 

45 Asn Met Met Gin Val Tyr Phe Asp Gin Gly He His Asn Lys Lys Ala 
20 - 25 30 

Val Phe Glu Val Tyr Phe Arg Gin Gin Pro Phe Lys Asn Gly Tyr Ala 
5Q 35 40 45 

Val Phe Ala Gly Leu Glu Arg He Val Asn Tyr Leu Glu Asp Leu Ara 
50 55 60 

Phe Ser Asp Ser Asp He Ala Tyr Leu Glu Ser Leu Gly Tyr His Glv 
55 . 65 70 75 80 

Ala Phe Leu Asp Tyr Leu Arg Asn Phe Lys Leu Glu Leu Thr Val Arg . 
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85 



90 



95 



Ser Ala Gin Glu Gly Asp Leu Val Phe Ala Asn Glu Pro He Val Gin 

100 105 HO 

Val Glu Gly Pro Leu Ala Gin Cys Gin Leu Val Glu Thr Ala Leu Leu 
115 120 125 



10 



Asn He Val Asn Tyr Gin Thr Leu Val Ala Thr Lys Ala Ala Arg He 
130 135 140 



15 



20 



Arg Ser Val He Glu Asp Glu Pro Leu Met Glu Phe Gly Thr Arg Arg 
145 150 155 160 

Ala Gin Glu Thr Asp Ala Ala He Trp Gly Thr Arg Ala Ala Val He 
165 170 175 

Gly Gly Ala Asn Gly Thr Ser Asn Val Arg Ala Gly Lys Leu Phe Asp 
180 185 190 

He Pro Val Leu Gly Thr His Ala His Ala Leu Val Gin Val Tyr Gly 
195 200 205 



25 



Asn Asp Tyr Glu Ala Phe Lys Ala Tyr Ala- Ala Thr His Lys Asn Cys 
210 215 220 



30 



35 



Val Phe Leu Val Asp Thr Tyr Asp Thr Leu Arg He Gly Val Pro Ala 
225 230 235 240 

Ala He Gin Val Ala Arg Glu Leu Gly Asp Gin He Asn Phe Met Gly 
245 250 255 

Val Arg He Asp Ser Gly Asp He Ala Tyr He Ser Lys Lys Val Arg 
260 265 270 

Gin Gin Leu Asp Glu Ala Gly Phe Thr Glu Ala Lys He Tyr Ala Ser 
275 280 285 



40 



Asn Asp Leu Asp Glu Asn Thr lie Leu Asn Leu Lys Met Gin Lys Ala 
290 295 300 



45 



Lys He Asp Val Trp Gly Val Gly Thr Gin Leu He Thr Ala Tyr Asp 
305 310 315 320 

Gin Pro Ala Leu Gly Ala Val Tyr Lys He Val Ala He Glu Asp Glu 
325 330 . 335 



50 



Thr Gly Gin Met Arg Asn Thr He Lys- Leu Ser Asn Asn Ala Glu Lys 
340 345 350 

Val Ser Thr Pro Gly Lys Lys Gin Val Trp Arg He Thr Ser Arg Glu 
355 360 365 



55 



Lys' Gly Lys Ser Glu Gly Asp Tyr He Thr Tyr Asp Gly Val Asp He 
370 375 380 



Ser Asp Met Thr Glu He Lys Met Phe His Pro Thr Tyr Thr Tyr He 
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385 390 395 400 

Lys Lys Thr Val Arg Asn Phe Asp Ala Val Pro Leu Leu Val Asp lie 
405 410 415 

Phe Lys Glu Gly lie Leu Val Tyr Asn Leu Pro Ser Leu Thr Asp lie 
420 425 430 

Gin Asp Tyr Ala Arg Lys Glu Phe Asp Lys Leu Trp Asp Glu Tyr Lys 
10 435 440 445 

Arg Val Leu Asn Pro Gin His Tyr Pro Val Asp Leu Ala Arg Asp Val 
450 455 460 

15 Trp Gin Asp Lys Met Asp Leu He Asp Lys Met Arg Lys Glu Ala Leu 
465 470 475 480 

Gly Glu Gly Glu Glu Glu 
485 

20 

<210> 120 
<211> 283 
<212> PRT 
25 <213> Streptococcus pneumoniae 

<400> 120 

Met Ala Thr He Gin Trp Phe Pro Gly His Met Ser Lys Ala Arg Arg ' 

30 1 

Gin Val Gin Glu Asn Leu Lys Phe Val Asp Phe Val Thr He Leu Val 
20 25 30 

Asp Ala Arg Leu Pro Leu Ser Ser Gin Asn Pro Met Leu Thr Lys lie 
35 35 40 45 

Val Gly Asp Lys Pro Lys Leu Leu He Leu Asn Lys Ala Asp Leu Ala 
50 55 60 

40 Asp Pro Ala Met Thr Lys Glu Trp Arg Gin Tyr Phe Glu Ser Gin Gly 
65 70 75 el 

He Gin Thr Leu Ala lie Asn Ser Lys Glu Gin Val Thr Val Lys Val 
45 85 ' ' 90 95 

Val Thr Asp Ala Ala Lys Lys Leu Met Ala Asp Lys He Ala Arg Gin 
100 105 no 

Lys Glu Arg Gly He Gin He Glu Thr Leu Arg Thr Met He He Glv 
50 115 120 . 125 

He Pro Asn Ala Gly Lys Ser Thr Leu Met Asn Arg Leu Ala Gly Lvs 
130 135 140 

55 Lys He Ala Val Val Gly Asn Lys Pro Gly Val Thr Lys Gly Gin Gin 
145 150 155 160 
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Trp Leu Lys Thr Asn Lys Asp Leu Glu lie Leu Asp Thr Pro Gly He 
165 170 175 

Leu Trp Pro Lys Phe Glu Asp Glu Thr Val Ala Leu Lys Leu Ala Leu 
* 185 190 

Thr Gly Ala He Lys Asp Gin Leu Leu Pro Met Asp Glu Val Thr He 
195 200 205 

10 Phe Gly He Asn Tyr Phe Lys Glu His Tyr Pro Glu Lys Leu Ala Glu 
210 215 220 



15 



25 



30 



40 



Arg Phe Lys Gin Met Lys He Glu Glu Glu Pro Ser Val He He Met 

225 230 235 240 

Asp Met Thr Arg Ala Leu Gly Phe Arg Asp Asp Tyr Asp Arg Phe Tvr 

245 .250 255 



Ser Leu Phe Val Lys Glu Val Arg Asp Gly Lys Leu Gly Asn Tyr 



20 260 



265 270 



Thr 



Leu Asp Thr Leu Glu Asp Leu Asp Gly Asn Asp 
275 280 



<210> 121 
<211> 156 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 121 

Met He Asn Asn Val. Val Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
1 5 . io 15 

35 Leu Arg Tyr Thr Pro Ser Asn Val Ala Val Ala Thr Phe Thr Leu Ala 
20 25 30 



Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 
35 40 45 

He Asn Val Val Met Trp Arg Gin Gin Ala Glu Asn Leu Ala Asn Trp 
50 55 60 



Ala Lys Lys Gly Ser Leu He Gly Val Thr Gly Arg He Gin Thr Ara 
45 65 70 75 • 



80 



Ser Tyr Asp Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val ■ 
85 90 95 

50 Ala Glu Asn. Phe Gin Met Leu Glu. Ser Arg Ser Val Arg Glu Gly His 
100 105 HO' 

Thr Gly Gly Ala Tyr Ser Ala Pro Thr Ala Asn Tyr Ser Ala -Pro Thr 
55 115 120 125 

Asn Ser Val Pro Asp Phe Ser Arg Asn Glu Asn Pro Phe Gly Ala Thr 
130 135 140 
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Asn Pro Leu Asp lie Ser Asp Asp Asp Leu Pro Phe 
150 155 



<210> 122 
<211> 324 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 122 

Met Lys Thr Arg He Thr Glu Leu Leu Lys He Asp Tyr Pro He Phe 
1 5 10 15 

15 Gin Gly Gly Met Ala Trp Val Ala Asp Gly Asp Leu Ala Gly Ala Val 
20 25 30 



10 



20 



35 



Ser Lys Ala Gly Gly Leu Gly He He Gly Gly Gly Asn Ala Pro Lys 
35 40 45 

Glu Val Val Lys Ala Asn He Asp Lys He Lys Ser Leu Thr Asp Lvs 
50 55 60 



25 65 



Pro Phe Gly Val Asn He Met Leu Leu Ser Pro Phe Val Glu Asp II 



e 



7 0 75 go 



Val Asp Leu Val He Glu Glu Gly Val Lys Val Val Thr Thr Gly Ala 
85 90 95 

30 Gly Asn Pro Ser Lys Tyr Met Glu Arg Phe His Glu Ala Gly lie He 
100 105 no 



Val He Pro Val Val Pro Ser Val Ala Leu Ala Lys Arg Met Glu Lys 
115 . 120 125 

He Gly Ala Asp Ala Val He Ala Glu Gly Met Glu Ala Gly Gly His 
130 135 140 



He Gly Lys Leu Thr Thr Met Thr Leu Val Arg Gin Val Ala Thr Ala 
40 14 5 150 155 .160 

He Ser He Pro Val lie Ala Ala Gly Gly lie" Ala Asp Gly Glu Gly 
165 170 175 

45 Ala Ala Ala Gly Phe Met Leu Gly Ala Glu Ala Val Gin Val Gly Thr 
180 185 190 

Arg Phe Val. Val Ala Lys Glu Sex Asn' Ala His Pro Asn Tyr Lys Glu 
5Q 200 205 

. Lys lie Leu Lys Ala Arg Asp He Asp Thr Thr lie Ser Ala Gin His 
• 210 215 220 

Phe Gly His Ala Val Arg Ala lie Lys Asn Gin Leu Thr Arg Asp Phe 
55 225 230 235 2 40 

Glu Leu Ala Glu Lys Asp Ala Phe Lys Gin Glu Asp Pro Asp Leu Glu 
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245 250 255 

He Phe Glu Gin Met Gly Ala Gly Ala Leu Ala Lys Ala Val Val His 
260 265 270 

Gly Asp Val Glu Gly Gly Ser Val Met Ala Gly Gin lie Ala Gly Leu 
275 280 * 285 

Val Ser Lys Glu Glu Thr Ala Glu- Glu lie Leu Lys Asp Leu Tyr Tvr 
10 290 295 300 

Gly Ala Ala Lys Lys He Gin Glu Glu Ala Ser Arg Trp Thr Gly Val 
305 310 315 320 

15 Val Arg Asn Asp 



<210> 123 
20 <211> 140 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 123 

25 Met He Asp He Gin Gly He Lys Glu Ala Leu Pro His Arg Tyr Pro 
1 5 io is 

Met Leu Leu Val Asp Arg Val Leu Glu Val Ser Glu Asp Thr lie Val 
20 25 30 

30 

Ala He Lys Asn Val Thr lie Asn Glu Pro Phe Phe Asn Gly His Phe 
35 . 40 * 45 

Pro Gin Tyr Pro Val Met Pro Gly Val Leu He Met Glu Ala Leu Ala 
35 50 55 60 

Gin Thr Ala Gly Val Leu Glu Leu Ser Lys Pro Glu Asn Lys Gly Lys 
6 5 70 75 80 

40 Leu Val Phe Tyr Ala Gly Met Asp. Lys Val Lys Phe Lys Lys Gin Val 

85 90 95 

Val Pro Gly Asp Gin Leu Val Met Thr Ala Thr Phe Val Lys Arg Arg 
100 105 no 

45 

Gly Thr He Ala Val Val Giu Ala Lys Ala Glu Val Asp Gly Lys Leu 
115 120 125 

Ala Ala Ser Gly Thr Leu Thr Phe Ala He Gly Asn 
50 130 135 140 



<210> 124 
<211> 340 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 124 

Met He Asn Gin He Tyr Gin Leu Thr Lys Pro Lys Phe He Asn Val 
1 5 10 .15 

5 Lys Tyr Gin Glu Glu Ala He Asp Gin Glu Asn His He Leu He Arg 
20 25 30 



10 



Pro Asn Tyr Met Ala Val Cys His Ala Asp Gin Arg Tyr Tyr Gin Gly 
35 40 45 

Lys Arg Asp Pro Lys He Leu Asn Lys Lys Leu Pro Met Ala Met He 
50 55 60 



His Glu Ser Cys Gly He Val He Ser Asp Pro Ser Gly Thr Tyr Glu 
15 65 70 75 so 

Val Gly Gin Lys Val Val Met He Pro Asn Gin Ser Pro Met Gin Ser 
85 90 95 

20 Asp Glu Glu Phe Tyr Glu Asn Tyr Met Thr Gly Thr His Phe Leu Ser 
100 105 no 



25 



Ser Gly Phe Asp Gly Phe Met Arg Glu Phe Val Ser Leu Pro Lys Asp 

115 120 125 

Arg Val Val Ala Tyr Asp Ala He Glu Asp Thr Val Ala Ala He Thr 
130 135 140 



Glu Phe Val Ser Val Gly Met His Ala Met Asn Arg Leu Leu Thr Leu 
30 145 150 155 160 

Ala His Ser Lys Arg Glu Arg He Pro Val He Gly Asp Gly Ser Leu 
165 170 175 

35 Ala Phe Val Val Ala Asn He He Asn Tyr Thr Leu Pro Glu Ala Glu 
180 185 190 



40 



He Val Val He Gly Arg His Trp Glu Lys Leu Glu Leu Phe Ser Phe 
195 200 205 

Ala Lys Glu Cys Tyr He Thr Asp Asn lie Pro Glu Glu Leu Ala Phe 
210 215 • 220 



Asp His Ala Phe Glu Cys Cys Gly Gly Asp Gly Thr Gly Pro Ala He 
45 225 230 235 240 

Asn Asp Leu He Arg Tyr He Arg Pro Gin Gly Thr He Leu Met Met 
245 --250 255 

50 Gly Val Ser Glu Tyr Lys Val Asn Leu Asn Thr Arg Asp Ala Leu Glu 
260 265 270 



55 



Lys Gly Leu Leu Leu Val Gly Ser Ser Arg Ser Gly Arg He Asp Phe 
275 280 285 

Glu Asn Ala He Gin Met Met Lys Val Lys Lys Phe Ala Asn Arg Leu 
290 295 300 



60 



WO 01/49721 



PCT/US00/35604 



Lys Asn He Leu Tyr Leu Glu Glu Pro Val Arg Glu He Lys Asp He 

305 310 315 320 

5 His Arg Val Phe Ala Thr Asp Leu Asn Thr Ala Phe Lys Thr Val Phe 

325 330 335 



10 



Lys Trp Glu Val 
340 



<210> 125 

<211> 447 

<212> PRT 

15 <213> Streptococcus pneumoniae 



20 



<400> 125 

Met Asn Leu Lys Thr Thr Leu Gly Leu Leu Ala Gly Arg Ser Ser His 
15 10 15 

Phe Val Leu Ser Arg Leu Gly Arg Gly Ser Thr Leu' Pro Glv Lvs Val 
20 25 30 



Ala Leu Gin Phe Asp, Lys Asp He Leu Gin Asn Leu Ala Lys Asn Tyr 
^ 35 40 45 

Glu He Val Val Val Thr Gly Thr Asn Gly Lys Thr Leu Thr Thr Ala 
50 55 60 

30 Leu Thr Val Gly He Leu Lys Glu Val Tyr Gly Gin Val Leu Thr Asn 
65 . 70 75 80 



35 



Pro Ser Gly Ala Asn Met He Thr Gly lie Ala Thr Thr Phe Leu Thr 
85 90 95 

Ala Lys Ser Ser Lys Thr Gly Lys Asn lie Ala Val Leu Glu He Asp 
100 105 no 



Glu Ala Ser Leu Ser Arg He Cys Asp Tyr He Gin Pro Ser Leu Phe 
4U H5 120 125 

Val He Thr Asn He Phe Arg Asp Gin Met Asp Arg Phe Gly Glu He 
130 135 140 * 

45 Tyr Thr Thr Tyr Asn Met He Leu Asp Ala He Arg Lys Val Pro Thr 
145 150 155 160 

Ala Thr Val Leu Leu Asn Gly Asp Ser . Pro Leu Phe Tyr Lys Pro Thr 

50 

lie Pro Asn Pro lie Glu Tyr Phe Gly Phe Asp Leu Glu Lys Gly Pro 
180 185 190 

Ala Gin Leu Ala His Tyr Asn Thr Glu Gly lie Leu Cys Pro Asp Cvs 
55 195 200. 205 

Gin Gly He Leu Lys Tyr Glu His Asn Thr Tyr Ala Asn Leu Gly Ala 
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210 215 220 



Tyr He Cys Glu Gly Cys Gly Cys Lys Arg. Pro Asp Leu Asp Tyr Arg 

5 



225 230 235 240 



Leu Thr Lys Leu Val Glu Leu Thr Asn Asn Arg Ser Arg Phe Val He 
245 250 255 

Asp Gly Gin Glu Tyr Gly He Gin lie Gly Gly Leu Tyr Asn He Tyr 
1U 260 265 270 

Asn Ala Leu Ala Ala Val Ala He Ala Arg Phe Leu Gly Ala Asp Ser 
275 280 285 

15 Gin Leu He Lys Gin Gly Phe Asp Lys Ser Arg Ala Val Phe Glv Ara 
290 295 300 



Gin Glu Thr Phe His lie Gly Asp Lys Glu Cys Thr Leu Val Leu He 

20 



305 310 315 320 



Lys Asn Pro Val Gly Ala Thr Gin Ala He Glu Met He Lys Leu Ala 
325 330 335 

Pro Tyr Pro Phe Ser Leu Ser Val Leu Leu Asn -Ala Asn Tyr Ala Asp 
25 340 345 350 

Gly He Asp Thr Ser Trp He Trp Asp Ala Asp Phe Glu Gin He Thr 
355 360 365 

30 Asp Met Asp He Pro Glu He Asn Ala Gly Gly Val Arg His Ser Glu 
370 375 380 



He Ala Arg Arg Leu Arg Val Thr Gly Tyr Pro Ala Glu Lys lie Thr 
35 385 390 395 400 

Glu Thr Ser Asn Leu Glu Gin Val Leu Lys Thr He Glu Asn Gin Asp 
405 410 415 



Cys Lys His Ala Tyr He Leu Ala Thr Tyr Thr Ala Met Leu Glu Phe 
40 420 425 430 

Arg Glu Leu Leu Ala Ser Arg Gin He Val Arg Lys Glu Met Asn 
435 440 445 

45 

<210> 126 
<211> 260 
<212> PRT 
^ <213> Streptococcus pneumoniae 

<400> 126 

Met Val Tyr Thr Ser Leu Ser Ser Lys Asp Gly Asn Tyr Pro Tyr Gin 
15 10 15 

55 Leu Asn He Ala His Leu Tyr Gly Asn.Leu Met Asn Thr Tyr Gly Asp 
20 25 30 
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Asn Gly Asn lie Leu Met Leu Lys Tyr Val Ala Glu Lys Leu Gly Ala 
35 40 45 

His Val Thr Val Asp He Val Ser Leu His Asp Asp Phe Asp Glu Asn 
5 50 55 60 

His Tyr Asp He Ala Phe Phe Gly Gly Gly Gin Asp Phe Glu Gin Ser 
65 70 75 eo 

10 He He Ala Asp Asp Leu Pro Ala Lys Lys Glu Ser He Asp Asn Tyr 

85 90 95 



15 



He Gin Asn Asp Gly Val Val Leu Ala He Cys Gly Gly Phe Gin Leu 
100 105 no 

Leu Gly Gin Tyr Tyr Val Glu Ala Ser Gly Lys Arg He Glu Gly Leu 
115 120 125 



Gly Val Met Gly His Tyr Thr Leu Asn Gin Thr Asn Asn Arg Phe lie 
20 . 130 135 140 

Gly Asp He Lys He His Asn Glu Asp Phe Asp Glu Thr Tyr Tyr Gly 
14 5 150 155 160 

25 Phe Glu Asn His Gin Gly Arg Thr Phe Leu Ser Asp Asp Gin Lys Pro 

165 170 175 



30 



Leu Gly Gin Val Val Tyr Gly Asn Gly Asn Asn Glu Glu Lys Val Gly 
180 185 190 

Glu Gly Val His Tyr. Lys Asn Val Phe Gly Ser Tyr Phe His Gly Pro 
195 200 205 



He Leu Ser Arg Asn Ala Asn Leu Ala Tyr Arg Leu Val Thr Thr Ala 
35 210 215 220 

" Leu Lys Lys Lys Tyr Gly Gin Asp He Gin Leu Pro Ala Tyr Glu Asp 
225 230 235 240 

40 -He Leu Ser Gin Glu He Ala Glu Glu Tyr Ser Asp Val Lys Ser Lys 

245 . 250 255 

Ala Asp Phe Ser 
260 

45 

<210> 127 
<211> 223 
<212> PRT 
50 <213> Streptococcus pneumoniae 

<400> 127 

Met Asn Val Lys Glu Asn Thr Glu Leu Val Phe Arg Glu Val Ala Glu 

55 /■ 

Ala Ser Leu Ser Ala Asn Arg Glu Ser Gly Ser Val Ser Val lie Ala 
20 25 30 



63 



WO 01/49721 



PCT/US00/35604 



Val Thr Lys Tyr Val Asp Val Pro Thr Ala Glu Ala Leu Leu Pro Leu 
35 40 45- 

5 Gly Val His His lie Gly Glu Asn Arg Val Asp Lys Phe Leu Glu Lys 
50 55 60 

Tyr Glu Ala Leu Lys Asp Arg Asp Val Thr Trp His Leu lie Gly Thr 

65 7 0 75 Rn 

10 80 

Leu Gin Arg Arg Lys Val Lys Asp Val He Gin Tyr Val Asp Tyr Phe 
85 90 95 

His Ala Leu Asp Ser Val Lys Leu Ala Gly Glu He Gin Lys Arq Ser 
15 100 105 ilo 

Asp Arg Val He Lys Cys Phe Leu Gin Val Asn He Ser Lys Glu Glu 
115 120 125 

20 Ser Lys His Gly Phe Ser Arg Glu Glu Leu Leu Glu He Leu Pro Glu 
13 ° 135 140 



25 



50 



Leu Ala Gly Leu Asp Lys He Glu Tyr Val Gly Leu Met Thr Met Ala 
145 150 155 160 

Pro Phe Glu Ala Ser Ser Glu Gin Leu Lys Glu He Phe Lys Ala Ala 
165 170 175 



Gin Asp Leu Gin Arg Glu He Gin Glu Lys Gin He Pro Asn He Pro 
3U 180 185 190 

Met Thr Glu Leu Ser Met Gly Met Ser Arg Asp Tyr Lys Glu Ala He 
195 200 205 

35 Gin Phe Gly Ser Thr Phe Val Arg He Gly Thr Ser Phe Phe Lys 
210 215 220 



<210> 128 
40 <211> 279 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 128 

45 Met Gly He Ala Leu Glu Asn Val Asn Phe Thr Tyr Gin Glu Gly Thr 
1 5 10 15 



Pro Leu Ala Ser Ala Ala Leu Ser Asp Val Ser Leu Thr He Glu Asp 
20 25 30 

Gly Ser Tyr Thr Ala Leu He Gly His Thr Gly Ser Gly Lys Ser Thr 
35 40 45 



He .Leu Gin Leu Leu Asn Gly Leu Leu Val Pro Ser Gin Gly Ser Val 
55 50 55 60 

Arg Val Phe Asp Thr Leu lie Thr Ser Thr Ser Lys Asn Lys Asp He 
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65 7 0 75 go 

Arg Gin He Arg Lys Gin Val Gly Leu Val Phe Gin Phe Ala Glu Asn 
85 90 95 

Gin He Phe Glu Glu Thr Val Leu Lys Asp Val Ala Phe Gly Pro Gin 
100 105 no 

Asn Phe Gly Val Ser Glu Glu Asp Ala Val Lys Thr Ala Arq Glu Lvs 
10 115 120 125 

Leu Ala Leu Val Gly He Asp Glu Ser Leu Phe Asp Arg Ser Pro Phe 
130 135 140 

15 • Glu Leu Ser Gly Gly Gin Met Arg Arg Val Ala He Ala Gly He Leu 
145 150 155 i 6 o 

Ala Met Glu Pro Ala He Leu Val Leu Asp Glu Pro Thr Ala Gly Leu 

165 170 175 

20 . 

Asp Pro, Leu Gly Arg Lys Glu Leu Met Thr Leu Phe Lys Lys Leu His 
180 185 190 

Gin Ser Gly Met Thr He Val Leu Val Thr His Leu Met Asp Asp Val 
25 195 200 - 205 " 

Ala Glu Tyr Ala Asn Gin Val Tyr Val Met Glu Lys Gly Arg Leu Val 
210 215 220 

30 Lys Gly Gly Lys Pro Ser Asp Val Phe Gin Asp Val Val Phe Met Glu 
22 5 - . 230 235 240 

Glu Val Gin Leu Gly Val Pro Lys He Thr Ala Phe Cys Lys Arg Leu 

35 245 250 

Ala Asp Arg Gly Val Ser Phe Lys Arg Leu Pro Val Lys lie Glu Glu 
260 ... 265 270 

Phe Lys Glu Ser Leu Asn Gly 
40 275 



<210> 129 
<211> 309 
45 <212> PRT 

<213> Streptococcus pneumoniae 



<400> 129 

Met Asp He Gin Phe Leu Gly Thr Gly Ala Gly Gin Pro Ser Lys Ala 
50 1 5 io , 15 



Arg Asn Val Ser Ser Leu Ala Leu Lys Leu Leu Asp Glu He Asn Glu 
20 . 25 30 

55 Val Trp Leu Phe Asp Cys Gly Glu Gly Thr Gin Asn Arg He Leu Glu 
35 40 45 
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Thr Thr He Arg Pro Arg Lys Val Ser Lys lie Phe He Thr His Leu 
50 55 60 

His Gly Asp His He Phe Gly Leu Pro Gly Phe Leu Ser Ser Arg Ala 
3 65 - 70 75 80 

Phe Gin Ala Asn Glu Glu Gin Thr Asp Leu Glu He Tyr <31y Pro Gin 
85 go g5 

10 Gly He Lys Ser Phe Val Leu Thr Ser Leu Arg Val Ser Gly Ser Arg 
100 105 no 

Leu Pro Tyr Arg lie His Phe His Glu Phe Asp Gin Asp Ser Leu Glv 
15 115 120 125 

Lys lie Leu Glu lie Asp Lys Phe Thr Val Tyr Ala Glu Glu Leu Asp 
130 135 140 ^ 

20 ut Thr 116 Phe CyS HI Gly Tyr Arg Val *H Gln Lys As P Leu G1 * 

.Gly Thr Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro Phe Gly 
165 170 175 

25 Pro Leu Phe Gly Lys He Lys Asn Gly Gin Asp Leu Val Leu Glu Aso 
180 185 190- 

Gly Thr Glu He Lys Ala Ala Asp Tyr lie Ser Ala Pro Arg Pro Glv ' 
30 195 200 205 

Lys lie He Thr He Leu Gly Asp Thr Arg Lys Thr Asp Ala Ser Val 
210 215 220 

Q< ™? LSU Ala Val Asn Ala Asp Val Leu Val His Glu Ser Thr Tyr Glv 
225 230 235 --- 



240 



Lys Gly Asp Glu Lys He Ala Arg Asn His Gly His Ser Thr Asn Met 
245 250 255 

40 Gin Ala Ala Gin Val Ala Val Glu Ala Gly Ala Lys Arg Leu Leu Leu 
260 265 270 

Asn His He Ser Ala Arg Phe Leu Ser Lys Asp He Ser Lys Leu Lvs 
45 275 280 285 

lys Asp Ala Ala Thr He Phe Glu Asn Val His Val Val Lys Asp Leu 
290 295 300 • 

Glu Glu Val Glu He 
50 305 



<210> 130 
<211> 553 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 130 

Met Ser Asn lie Ser Leu Thr Thr Leu Gly Gly Val Arg Glu Asn Gly 
5 10 15 

5 Lys Asn Met Tyr lie Ala Glu lie Gly Glu Ser lie Phe Val Leu Asn 



20 25 



30 



15 



Val Gly Leu Lys Tyr Pro Glu Asn Glu Gin Leu Gly Val Asp Val Val 

10 40 45 

He Pro Asn Met Asp Tyr Leu Phe Glu Asn Ser Asp Arg lie Ala Gly 
3U 55 60 

Val Phe Leu Thr His Gly His Ala Asp Ala He Gly Ala Leu Pro Tyr 
" 70 75 l 0 

Leu Leu Ala Glu Ala Lys Val Pro Val- Phe Gly Ser Glu Leu Thr lie 
85 9° 95 

20 Glu Leu Ala Lys Leu Phe Val Lys Gly Asn Asp Ala Val Lys Lys Phe 

Asn Asp Phe His Val lie Asp Glu Asn Thr Glu lie Asp Phe Gly Gly 
25 120 125 

Thr val Val Ser Phe Phe Pro Thr Thr Tyr Ser Val Pro Glu Ser Leu 

135 140 



30 



Gly lie Val Leu Lys Thr Ser Glu Gly Ser lie Val Tyr Thr Gly Asp 
5 150 155 1{ * 

Phe Lys Phe Asp Gin Thr Ala Ser Glu Ser Tyr Ala Thr Asp Phe Ala 
165 170 175 

35 Arg Leu Ala Glu He Gly Arg Asp Gly Val Leu Ala Leu Leu Ser Asp 

180 ioc r 



Ser Ala Asn Ala Asp Ser Asn lie Gin Val Ala Ser Glu Ser Glu Val 
4 0 195 200 205 

Arg Asp Glu lie Thr Gin Thr lie Ala Asp Trp Glu Gly Arg lie lie 
ZAU 215 220 

45 225 Ma 230~ ~~ Gln Ile ' PhS ASP 

Ala Ala Asp Lys Thr Gly Arg Arg lie Val Leu Thr Gly Phe Asp Ile 
245 



235 



240 



250 255 



50 Glu Asn Ile Val Arg Thr Ala He Arg Leu Lys Lys Leu Ser Leu Ala 
260 265 270 

Asn Glu lie Leu Leu Ile Lys Pr p Lys Asp M et Ser Arg Phe Glu Asp 
275 280 285 



55 



His Glu Leu He Ile Leu Glu Thr Gly Arg Met Gly Glu Pro lie Asn 
zyu 295 300 
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Gly Leu Arg Lys Met Ser He Gly Arg His Arg Tyr Val Glu lie Lvs 
305 310 315 320 

5 Asp Gly Asp Leu Val Tyr lie Ala Thr Ala Pro Ser He Ala Lys Glu 
325 330 ' 335 

Ala Phe Val Ala Arg Val Glu Asn Met He' Tyr Gin Ala Gly Gly Val 
340 345 350 

10 

Val Lys Leu He Thr Gin Ser Leu His Val Ser Gly His Gly Asn Val 
355 360 365 

Arg Asp Leu Glri Leu Met He Asn Leu Leu Gin Pro Lys Tyr Leu Phe 
15 370 375 380 

Pro Val Gin Gly Glu Tyr Arg Glu Leu Asp Ala His Ala Lys Ala Ala 
385 390 395 400 

20 Met Ala Val Gly Met Leu Pro Glu Arg He Phe He Pro Lys Lys Gly 

405 410 415 

Thr Thr Met Ala Tyr Glu Asn Gly Asp Phe Val Pro Ala Gly Ser Val 
25 420 425 430 

Ser Ala Gly Asp He Leu He Asp Gly Asn Ala He Gly Asp Val Gly 
435 440 445 

Asn Val Val Leu Arg Asp Arg Lys Val Leu Ser Glu Asp Gly He Phe 
30 450 455 460 

He Val Ala He Thr Val Asn Arg Arg Glu Lys Lys He Val Ala Arg 
465 470 475 480 

35 Ala Arg Val His Thr Arg Gly Phe Val Tyr Leu Lys Lys Ser Arg Asp 

485 490 495 

He Leu Arg Glu Ser Ser Glu Leu He Asn Gin Thr Val Glu Asp Tyr 
4Q 500 505 510 

Leu Gin Gly Asp Asp Phe Asp Trp Ala Asp Leu Lys Gly Lys Val Arg 
515 520 525 

Asp Asn Leu Thr Lys Tyr Leu Phe Asp Gin Thr Lys Arg Arg Pro -Ala 
45 530 535- 540 

He Leu Pro Val Val Met Glu Ala Lys 
545 550 

50 

<210> 131 
<211> 316 
. <212> PRT ; 
^ <213> Streptococcus pneumoniae 

<400> 131 

Met Thr Lys Glu Phe His His Val Thr Val Leu Leu His Glu Thr He 
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15 10 15 

Asp Met Leu Asp Val Lys Pro Asp Gly He Tyr Val Asp Ala Thr Leu 



20 



25 



30 



Gly Gly Ala Gly His Ser Glu Tyr Leu Leu Ser Lys Leu Ser Glu Lvs 
35 40 45 

Gly His Leu Tyr Ala Phe Asp Gin Asp Gin Asn Ala He Asp Asn Ala 
10 50 55 60 



Gin .Lys Arg Leu Ala Pro Tyr He Glu Lys Gly Val Val Thr Phe lie 

65 ™ IS 80 

15 Lys Asp Asn Phe Arg His Leu Gin Ala Arg Leu Arg Glu Ala Gly Val 

85 90 95 



20 



Gin Glu He Asp Gly He Cys Tyr Asp Leu Gly Val Ser Ser Pro Gin 
100 105 no 

Leu Asp Gin Arg Glu Arg Gly Phe Ser Tyr Lys Lys Asp Ala Pro Leu 
115 120 125 



. Asp Met Arg Met Asn Gin Asp Ala Ser Leu Thr Ala Tyr Glu Val Val 
25 130 135 140 

Asn His Tyr Asp Tyr His Asp Leu Val Arg lie Phe Phe Lys Tyr Gly 
145 150 155 . . i 6 o 

30 Glu Asp Lys Phe Ser Lys Gin He Ala Arg Lys He Glu Gin Ala Arg 

165 170 175 



35 



Glu Val Lys Pro He Glu Thr Thr Thr Glu Leu Ala Glu He He Lys 

180 185 190 

Leu Val Lys Pro Ala Lys Glu Leu Lys Lys Lys Gly His Pro Ala Lys 
195 200 205 



Gin He Phe Gin Ala He Arg He Glu Val Asn Asp Glu Leu Gly Ala 
40 210 • 215 



220 



Ala Asp Glu Ser He Gin Gin Ala Met Asp Met Leu Ala Leu Asp Gly 
225 230 235 240 

45 Arg He Ser Val He Thr Phe His Ser Leu Glu Asp Arg Leu Thr Lys 

245 250 255 



50 



Gin Leu Phe Lys Glu Ala Ser Thr Val" Glu Val Pro Lys Gly Leu Pro 
260 265 270 

Phe He Pro Asp Asp Leu Lys Pro Lys Met Glu Leu Val Ser Arg Lys 
275 280 285 



Pro He Leu Pro Ser Ala Glu Glu Leu Glu Ala Asn Asn Arg Ser His 
55 290 295 300 

Ser Ala Lys Leu Arg Val Val Arg Lys He His Lys 
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305 



310 



315 



<210> 132 
<211> 332 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 132 

10 Met Ser Arg He Leu Asp Asn Glu He Met Gly Asp Glu Glu Leu Val 
1 5 10 15 

Glu Arg Thr Leu Arg Pro Gin Tyr Leu Arg Glu Tyr He Gly Gin Aso 
20 25 30 



15 



Lys Val Lys Asp Gin Leu Gin He Phe He Glu Ala Ala Lys Met Ara 
35 40 45 



30 



Asp Glu Ala Leu Asp His Val Leu Leu Phe Gly Pro Pro Gly Leu Glv 
20 50 '55 60 

Lys Thr Thr Met Ala Phe Val He Ala Asn Glu Leu Gly Val Asn Leu 
65 70 75 80 

25 Lys Gin Thr Ser Gly Pro Val He Glu Lys Ala Gly Asp Leu Val Ala 

85 90 95 

.lie Leu Asn Glu Leu Glu Pro Gly Asp Val Leu Phe He Asp Glu He 
100 105 no 

His Arg Leu Pro Met Ser Val Glu Glu Val Leu Tyr Ser Ala Met Glu 
115 120 125 

Asp Phe Tyr He Asp lie Met He Gly Ala Gly Glu Gly Ser Arq Ser 
35 130 * 135 140 

Val His Leu Glu Leu Pro Pro Phe Thr Leu He Gly Ala Thr Thr Arq 
145 150 155 leo 

40 Ala Gly Met Leu Ser Asn Pro Leu Arg Ala Arg Phe Gly He Thr Gly 

165 170 175 

His Met Glu Tyr Tyr Ala His Ala Asp Leu Thr. Glu He Val Glu Arg 
180 185 190 

45 

Thr- Ala Asp lie Phe Glu Met Glu He Thr His Glu Ala Ala Ser Glu 
195 200 205 

Leu Ala Leu Arg Ser Arg Gly Thr Pro Arg lie Ala Asn Arg Leu Leu 
50 210 215 220 

Lys Arg Val Arg Asp Phe Ala' Gin lie Met Gly Asn Gly Val lie Asn 
225 230 235 240 

55 Asp He He Thr Asp Lys Ala Leu Thr Met Leu Asp Val Asp His Glu 

245 250 - 255 
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Gly Leu Asp Tyr Val Asp Gin Lys lie Leu Arg Thr Met lie Glu Met 
260 265 270 

Tyr Ser Gly Gly Pro Val Gly Leu Gly Thr Leu Ser Val Asn He Ala 
5 275 280 285 

Glu Glu Arg Glu Thr Val Glu Asp Met Tyr Glu Pro Tyr Leu He Gin 
290 295 300 

10 Lys Gly Phe He Met Arg Thr Arg Ser Gly Arg Val Ala Thr Ala Lys 
305 310 315 320 

Ala Tyr Glu His Leu Gly Tyr Glu Tyr Ser Glu Lys 
325 330 

15 

<210> 133 
<211> 436 
<212> PRT 
20 <213> Streptococcus pneumoniae 

<400> 133 

Met Ser Met Phe Leu Asp Thr Ala Lys He Lys Val Lys Ala Gly Asn 

25 1 5 10 15 

Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Val Pro Asn 
20 25 30 

Gly Gly Pro Trp Gly Gly Asp Gly Gly Arg Gly Gly Asn Val Val Phe 
- 30 35 40 45 

Val Val Asp Glu Gly Leu Arg Thr Leu Met Asp Phe Arg Tyr Asn Arg 
50 55 60 

35 His Phe Lys Ala Asp Ser Gly Glu Lys Gly Met Thr Lys Gly Met His 
65 70 75 80 

Gly Arg Gly Ala Glu Asp Leu Arg Val Arg Val Ser Gin Gly Thr Thr 

40 85 90 95 

Val Arg Asp Ala Glu Thr Gly Lys Val Leu Thr Asp Leu He Lys His 
10Q 105 no 

Gly Gin Glu Phe lie Val Ala His Gly Gly Arg Gly Gly Arg Glv Asn 
45 115 120 125 

He Arg Phe Ala Thr Pro Lys Asn Pro Ala Pro Glu He Ser Glu Asn 
130 135 140 

50 Gly Glu Pro Gly Gin Glu Arg Glu Leu Gin Leu Glu Leu Lys He Leu 
145 150 155 160 

Ala Asp Val Gly Leu Val Gly Phe Pro Ser Val Gly Lys Ser Thr Leu 
55 I 65 170 175 

Leu Ser Val lie Thr Ser Ala Lys Pro Lys He Gly Ala Tyr His Phe 
180 185 . 190 
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Thr Thr He Val Pro -Asn Leu Gly Met Val Arg Thr Gin Ser Gly Glu 
195 200 205 

5 Ser Phe Ala Val Ala Asp Leu Pro Gly Leu He Glu Gly Ala Ser Gin 
210 215 220 

Gly Val Gly Leu Gly Thr Gin Phe Leu Arg His He Glu Arg Thr Arq 
1Q 225 230 235 2 40 

Val He Leu His He He Asp Met Ser Ala Ser Glu Gly Arg Asp Pro 
245 250 255 

Tyr Glu Asp Tyr Leu Ala He Asn Lys Glu Leu Glu Ser Tyr Asn Leu 
15 260 265 270 

Arg Leu Met Glu Arg Pro Gin He He Val Ala Asn Lys Met Asp Met 
275 280 285 

20 Pro Glu Ser Gin Glu Asn Leu Glu Glu Phe Lys Lys Lys Leu Ala Glu 
290 295 300 



25 



Asn Tyr Asp Glu Phe Glu Glu Leu Pro Ala He Phe Pro He Ser Gly 
305 310 315 320 

Leu Thr Lys Gin Gly Leu Ala Thr Leu Leu Asp Ala Thr Ala Glu Leu 
325 330 335 



Leu Asp Lys Thr Pro Glu Phe Leu Leu Tyr Asp Glu Ser Asp Met Glu 
30 340 345 350 

Glu Glu Ala Tyr Tyr Gly Phe Asp Glu Glu Glu Lys Ala Phe Glu He 
355 360 365 

35 Ser Arg Asp Asp Asp Ala Thr Trp Val Leu Ser Gly Glu Lys Leu Met 
370 375 380 



40 



Lys Leu Phe Asn Met Thr Asn Phe Asp Arg Asp Glu Ser Val Met Lys 
3 *5 390 395 4 00 

Phe Ala Arg Gin Leu Arg Gly Met Gly Val Asp Glu Ala Leu Arg Ala 
405 410 415 



Arg Gly Ala Lys Asp Gly Asp Leu Val Arg He Gly Lys Phe Glu Phe 
45 420 425 430 

Glu Phe Val Asp 
435 

50 

<210> 134 
<211> 172 
<212> PRT 
55 <213> stre Ptococcus pneumoniae 

<400> 134 

Met Asn Tyr Phe. Asn Val Gly Lys He Val Asn Thr Gin Gly Leu Gin 
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15 10 15 

Gly Glu Met Arg Val Leu Ser Val Thr Asp Phe Ala Glu Glu Ara Phe 
5 20 25 30 

Lys Lys Gly Ala Glu Leu Ala Leu Phe Asp Glu Lys Asp Gin Phe Val 
35 40 45 

Gin Thr Val Thr He Ala Ser His Arg Lys Gin Lys Asn Phe Asp He 
10 50 55 60 

He Lys Phe Lys Asp Met Tyr. His He Asn Thr He Glu Lys Tyr Lvs 
65 70 . 75 ^ 80 

15 Gly Tyr Ser Leu Lys Val Ala Glu Glu Asp Leu Asn Asp Leu Asp Asp 

85 90 95 

Gly Glu Phe Tyr Tyr His Glu He He Gly Leu Glu Val Tyr Glu Gly 
2Q " 100 105 no 

Asp Ser Leu Val Gly Thr He Lys Glu He Leu Gin Pro Gly Ala Asn 
115 120 125 

Asp Val Trp Val Val Lys Arg Lys Gly Lys Arg Asp Leu Leu Leu Pro 
*5 130 135 * 140 

Tyr He Pro Pro Val Val Leu Asn Val Asp He Pro Asn Lys Ara Val 
145 150 155 . 160 

30 Asp Val Glu lie Leu Glu Gly Leu Asp Asp Glu Asp 

165 170 

<210> 135 
35 <211> 239 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 135 

40 Met Lys He Asp He Leu Thr Leu Phe Pro Glu Met Phe Ser Pro Leu 
1 5 10 — 



15 



Glu His Ser He Val Gly Lys Ala Arg Glu Lys Gly Leu Leu Asp He 
45 20 25 30 

Gin Tyr His Asn Phe Arg Glu Asn Ala Glu Lys Ala Arg His Val Asp 
35 40 45 

Asp Glu Pro Tyr Gly Gly Gly Gin Gly Met Leu Leu Arg Ala Gin Pro 
50 50 55 60 

He Phe Asp Ser Phe Asp Ala lie Glu Lys Lys Asn Pro Arg Val He 
65 70 -75 . so 

55 Leu Leu Asp Pro Ala Gly Lys Gin Phe Asp Gin Ala Tyr Ala Glu Asp 

85 90 95 
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Leu Ala Gin Glu Glu Glu Leu He Phe He Cys Gly His Tyr Glu Glv 

100 105 110 

Tyr Asp Glu Arg He Lys Thr Leu Val Thr Asp Glu He Ser Leu Gly 
-> 115 120 125 

Asp Tyr' Val Leu Thr Gly Gly Glu Leu Ala Ala Met Thr Met He Asd 
130 135 140 

10 Ala Thr Val Arg Leu He Pro Glu Val He Gly Lys Giu Ser Ser His 
145 150 155 160 



15 



Gin Asp Asp Ser. Phe Ser Ser Gly Leu Leu Glu Tyr His Gin Tyr Thr 
165 • 170 175 

Arg Pro Tyr Asp Tyr Arg Gly Met Val Val Pro Asp Val Leu Met Ser 
180 185 190 



Gly His His Glu Lys He Arg Gin Trp Arg Leu Tyr Glu Ser Leu Lys 
20 195' 200 205 

Lys Thr Tyr Glu Arg Arg Pro Asp Leu Leu Glu His Tyr Gin Leu Thr 
210 215 220 

25 Val Glu Glu Glu Lys Met Leu Ala Glu He Lys Glu Asn Lys Glu 
225 230 235 



30 



<210> 136 
<211> 186 
<212> PRT 
35 <213> Streptococcus pneumoniae 



<400> 136 

Met He Glu Ala Ser Lys Leu Lys Ala Gly Met Thr Phe Glu Thr Ala 
40 1 5 10 is 

Asp Gly Lys Leu He Arg Val Leu Glu Ala Ser His His Lys Pro Gly 
20 25 30 

45 Lys Gly Asn Thr lie Met Arg Met Lys Leu Arg Asp Val Arg Thr Gly 
35 40 . .45 



50 



Ser Thr Phe Asp Thr Ser Tyr Arg Pro Glu Glu Lys Phe Glu Gin Ala 
50 55 60 

He He Glu Thr Val Pro Ala Gin Tyr Leu Tyr Lys Met Asp Asp Thr 
65 70 75 80 



Ala Tyr Phe Met Asn Thr Glu Thr Tyr Asp Gin Tyr Glu He Pro Val 
bt> 85 90 95 

Val Asn Val Glu Asn Glu Leu Leu Tyr He Leu Glu Asn Ser Asp Val 
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100 105 110 

Lys He Gin' Phe Tyr Gly Thr Glu Val He Gly Val Thr Val Pro Thr 
115 . 120 125 

Thr Val Glu Leu Thr Val Ala Glu Thr Gin Pro Ser He Lys Gly Ala 
130 135 140 

Thr Val Thr Gly Ser Gly Lys Pro Ala Thr Met Glu Thr Gly Leu Val 
145 150 155 160 

Val Asn Val Pro Asp Phe He Glu Ala Gly Gin Lys Leu Val He Asn 
165" 170 175 

Thr Ala Glu Gly Thr Tyr Val Ser Arg Ala 
180 185 



<210> 137 
<211> 523 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 137 

Met Ala Phe Glu Ser Leu Thr Glu Arg Leu Gin Asn Val Phe Lys Asn 
1 5 10 15 

Leu Arg Lys Lys Gly Lys He Ser Glu Ser Asp Val Gin Glu Ala Thr 
20. 25 30 

Lys Glu lie Arg Leu Ala Leu Leu Glu Ala Asp Val Ala Leu Pro Val 
35 40 45 

Val Lys Asp Phe He Lys Lys Val Arg Glu Arg Ala Val Gly His Glu 
50 55 60 

Val He Asp Thr Leu Asn Pro Ala Gin Gin He He Lys He Val Asp 
65 70 75 80 

Glu Glu Leu Thr Ala Val' Leu Gly Ser Asp Thr Ala Glu He He Lys 
85 90 / 95 

Ser Pro Lys He Pro Thr He He Met Met Val Gly Leu Gin Gly Ala 
100 105 HO 

Gly Lys Thr Thr Phe Ala Gly Lys Leu Ala Asn Lys Leu Lys Lys Glu 
115 120 ' 125 

Glu Asn Ala Arg Pro Leu Met He Ala Ala Asp lie Tyr Arg Pro Ala 
130 135 140 

Ala lie Asp Gin Leu Lys Thr Leu Gly Gin Gin He Asp Val Pro Val 
145 150 155. 160 

Phe Ala Leu Gly Thr Glu Val Pro Ala Val Glu lie Val Arg Gin Gly 
165 170 175 
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Leu Glu Gin Ala Gin Thr Asn His Asn Asp Tyr Val Leu He Asp Thr 
180 185 190 



5 Ala Gly Arg Leu Gin He Asp Glu Leu Leu Met Asn 



195 



200 



Glu Leu Arg Asp 
205 



10 



15 



Val Lys Thr Leu Ala Gin Pro Asn Glu He Leu Leu Val Val Asd Ala 
210 215 220 

Met He Gly Gin Glu Ala Ala Asn Val Ala Arg Glu Phe Asn Ala Gin 
225 23 0 235 240 

Leu Glu Val Thr Gly Val lie Leu Thr Lys He Asp Gly .Asp Thr Arg 



245 



250 



255 



Gly Gly Ala Ala Leu Ser Val Arg His He Thr Gly Lys Pro He Lys 
260 265 270 

20 Phe Thr Gly Thr Gly Glu Lys He Thr Asp He Glu Thr Phe His Pro 
275 280 285 



25 



Asp Arg Met Ser Ser Arg lie Leu Gly Met Gly Asp Met Leu Thr Leu 
290 295 300 

He Glu Lys Ala Ser Gin Glu ,Tyr Asp Glu Gin Lys Ala Leu Glu Met 
305 310 315 320 



Ala Glu Lys Met Arg Glu Asn Thr Phe Asp Phe Asn Asp Phe lie Asp 
325 330 335 

Gin Leu Asp Gin Val Gin Asn Met Gly Pro Met Glu Asp Leu Leu Lys 



340 



345 



350 



35 Met lie Pro Gly Met Ala Asn Asn Pro Ala Leu Gin Asn Met Lys Val 
355 360 365 



40 



45 



Asp Glu Arg Gin He Ala Arg Lys Arg Ala He Val Ser Ser Met Thr 
370 375 380 

Pro Glu Glu Arg Glu Asn Pro Asp Leu Leu Asn Pro Ser Arq Ara Ara 
385 390 395 40 0 



Arg lie Ala Ala Gly Ser Gly Asn Thr Phe Val Glu Val Asn Lys 



405 



410 



Phe 



415 



lie Lys Asp Phe Asn Gin Ala Lys Gin Leu Met Gin Gly Val Met Ser 
420 425" 430 

.50 Gly Asp Met Asn Lys Met Met Lys Gin Met Gly lie Asn Pro Asn Asn 
435 440 445 



55 



Leu Pro Lys Asn Met Pro Asn Met Gly Gly Met Asp Met Ser Ala Leu 
450 455 460 

Glu Gly Met Met Gly Gin Gly Gly Met Pro Asp Leu Ser Ala Leu Glv 
465 470 475 480 
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Gly Ala Gly Met Pro Asp Met Ser Gin Met Phe Gly Gly Gly Leu Lys 
485 490 495 

5 Gly Lys lie Gly Glu Phe Ala Met Lys Gin Ser Met Lys Arg Met Ala 
500 505 510 

Asn Lys Met Lys Lys Ala Lys Lys Lys Arg Lys 
515 520 

10 

<210> 138 
<211> 281 
<212> PRT 
15 <213> Streptococcus pneumoniae 

<400> 138 

Met Tyr Leu lie Glu lie Leu Lys Ser He Phe Phe Gly He Val Glu 
1 5 10 is 

20 

Gly He Thr Glu Trp Leu Pro He Ser Ser Thr Gly His Leu He Leu 
20 25 30 

Ala Glu Glu Phe He Gin Tyr Gin Asn Gin Asn Glu Ala Phe Met Ser 
25 35 40 45 

Met Phe Asn Val Val He Gin Leu Gly Ala He Leu Ala Val Met Val 
50 55 60 

30 He Tyr Phe Asn Lys Leu Asn. Pro Phe Lys Pro' Thr Lys Asp Lys Gin 
65 70 75 80 

Glu Val Arg Lys Thr Trp Arg Leu Trp Leu Lys Val Leu He Ala Thr 
85 90 95 

35 

Leu Pro Leu Leu Gly Val Phe Lys Phe Asp Asp Trp Phe Asp Thr His 
100 105 HO 

Phe His Asn Met Val Ser Val Ala Leu Met Leu He He Tyr Gly Val 
40 * 115 120 125 

Ala Phe He Tyr Leu Glu Lys Arg Asn Lys Ala Arg Ala He Glu Pro 
130 135 140 

45 Ser Val Thr Glu Leu Asp Lys Leu Pro Tyr Thr Thr Ala- Phe Tyr He 
145 150 155 160 

Gly Leu Phe Gin Val Leu Ala Leu Leu" Pro Gly Thr Ser Arg Ser Gly 
165 170 175 

50 

Ala Thr He Val Gly Gly Leu Leu Asn Gly Thr Ser Arg Ser Val Val 
180 185 190 

Thr Glu Phe Thr Phe Tyr Leu Gly He Pro Val Met Phe Gly Ala Ser 
55 195 200 205 

Ala Leu Lys He Phe Lys Phe Val Lys Ala Gly Glu Leu Leu Ser Phe 
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210 215 220 

Gly Gin Leu Phe Leu Leu Leu Val Ala Met Gly Val Ala Phe Ala Val 

5 



225 230 235 240 



Ser Met Val Ala He Arg Phe Leu Thr Ser Tyr Val Lys Lys His Asp 
245 250 255 

Phe Thr Leu Phe Gly Lys Tyr Arg He Val Leu Gly Ser Val Leu Leu 
10 260 265 270 

Leu Tyr Ser Phe Val Arg Leu Phe Val 
275 280 

15 

<210> 139 
<211> 429 
<212> PRT 

<213> Streptococcus pneumoniae 

20 

<400> 139 

Met Gly Leu Phe Asp Arg Leu Phe Gly Lys Lys Glu Glu Pro Lys He 
1 5 io 15 

25 Glu Glu Val Val Lys Glu Ala Leu Glu Asn Leu Asp Leu Ser Glu Asd 
20 25 30 

He Glu Pro Ala Phe Thr Glu Ala Glu Glu Val Ser Gin Glu Glu Ala 
35 40 45 

30 ^ 

Glu Val Glu Ser Ser Glu Glu Ser Val Phe Gin Glu Glu Asp Ser Gin 
50 55 60 

Asp Thr Val Glu Glu Asn Leu Asp Leu Glu Pro Val Val Glu Val Ser 
35 65 70 75 80 

Gin Glu Glu Val Glu Glu Phe Pro Asn Ser Gin Glu Val Thr Glu Glu 
85 90 95 

40 Glu Lys Leu Glu His Glu Gly" Thr Val Glu Glu Asn Asn Phe Glu Val 
100 ^ 105 no 

Leu Glu Pro Glu Ala Pro Gin Thr Glu Glu Thr Val Gin Glu Lys Tvr 
45 "5 120 125 

Asp Arg Ser Leu Lys Lys Thr Arg Thr Gly Phe Gly Ala Arg Leu Asn 
130 135 140 

Ala Phe Phe Ala Asn Phe Arg Ser Val Asp Glu Glu Phe " Phe Glu Glu 
50 145 150 155 i 6 o 

Leu Glu Glu Leu Leu -He Met Ser Asp Val Gly Val Gin Val Ala Ser 
165 170 175 

55 Asn Leu Thr Glu Glu Leu Arg Tyr Glu Ala Lys Leu Glu Asn Ala Lys 
180 185 190 
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Lys Pro Asp Ala Leu Arg Arg Val He He Glu Lys Leu Val Glu Leu 
195 200 205 

Tyr Glu Lys Asp Gly Ser Tyr Asp Glu Ser lie His Phe Gin Asp Asn 
5 210 215 220 

Leu Thr Val Met Leu Phe Val Gly Val Asn Gly Val Gly Lys Thr Thr 
225 ' 230 235 240 

10 Ser He Gly Lys Leu Ala His Arg Tyr Lys Arg Ala Gly Lys Lys Val 

245 250 255 



15 



Met Leu Val Ala Ala Asp Thr Phe Arg Ala Gly Ala Val Ala Gin Leu 
260 265 270 

Ala Glu Trp Gly Arg Arg Val Asp Val Pro Val Val Thr Gly Pro Glu 
275 280 285 



Lys Ala Asp Pro Ala Ser Val Val Phe Asp Gly Met Glu Arg Ala Val 
20 290 295 300 

Ala Glu Gly He Asp He Leu Met He Asp Thr Ala Gly Arg Leu Gin 
305 - 310 315 320 

25 Asn Lys Asp Asn Leu Met Ala Glu Leu Glu Lys He Gly Arg He He 

325 330 335 



30 



Lys Arg Val Val Pro Glu Ala Pro His Glu Thr Phe Leu Ala Leu Asp 
34 0 . 345 350 

Ala Ser Thr Gly Gin Asn Ala Leu Val .Gin Ala Lys Glu Phe Ser Lys 
355 360 365 



He Thr Pro Leu Thr Gly He Val Leu Thr Lys He Asp Gly Thr Ala 
35 370 375 380 

Arg Gly Gly Val Val Leu Ala He Arg Glu Glu Leu Asn He Pro Val 
385 390 .395 400 

40 Lys Leu He Gly Phe Gly Glu Lys He Asp Asp He Gly Glu . Phe Asn 

405 410 4 15 

Ser Glu Asn Phe Met Lys Gly Leu Leu Glu Gly Leu He 
420 425 

45 

<210> 140 
<211> 165 
<212> PRT 
50 <213> Streptococcus pneumoniae 

<400> 140 * 
Met Tyr He Glu Met Val Asp Glu Thr Gly Gin Val Ser Lys Glu Met 

55 1 

Leu Gin Gin Thr Gin Glu He Leu Glu Phe Ala Ala Lys Lys Leu Gly 
20 25 30 
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Lys Glu Asp Lys Glu Met Ala Val Thr Phe Val Thr Asn Glu Arg Ser 
35 40 45 

5 His Glu Leu Asn Leu Glu Tyr Arg Asp Thr Asp Arg Pro Thr Asd Val 
5 ° 55 60 

He Ser Leu Glu Tyr Lys Pro Glu Leu Glu He Ala Phe Asp Glu Glu 

10 65 

Asp Leu Leu Glu Asn Pro Glu Leu Ala Glu Met Met Ser Glu Phe Asp 
85 90 .95 

Ala Tyr lie Gly Glu Leu Phe lie Ser lie Asp Lys .Ala His Glu Gin 

1!> ioo 105 1I0 

Ala Glu Glu Tyr Gly His Ser Phe Glu Arg Glu Met Gly Phe Leu Ala 
115 120 125 

20 Val His Gly Phe Leu His He Asn Gly Tyr Asp His Tyr Thr Pro Glu 
130 135 140 



25 



Glu Glu Ala Glu Met Phe Gly Leu Gin Glu Glu He Leu Thr Ala Tyr 
145 150 155 i6 0 

Gly Leu Thr Arg Gin 
165 



30 <210> 141 
<211> 255 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 141 

Met Ser He Arg Val lie He Ala Gly Phe Lys Gly Lys Met Gly Gin 
15 io 



15 



Ala Ala Cys Gin Met Val Leu Thr Asp Pro Asp Leu Asp Leu Val Ala 
40 20 25 30. 

Val Leu Asp Pro Phe Glu Ser Glu Ser Glu Trp Gin Gly He Pro Val 
35 - 40 45 

45 Phe Lys Asp Lys Ala Asp Leu Ala Gly Phe Glu Ala Asp Val Tro Val 
50 55 * 



60 



Asp Phe Thr Thr Pro Ala Val Ala Tyr Glu Asn Thr Arg Phe Ala Leu 
50 65 70 75 ' 

Glu Asn Gly Phe Ala Pro Val Val Gly Thr Thr Gly Phe Thr Ser Glu 
85 90 95 

Glu He Ala Glu Leu Lys Glu Phe Ser Arg Ala Gin Asp Leu Gly Gly 
D:> 100 105 no 

Leu He Ala Pro Asn Phe Ala Leu Gly Ala Val Leu Leu Met Gin Phe 
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120 125 

Ala Thr Gin Ala Ala Lys Tyr Phe Pro Asn Val Glu He He Glu Leu 
130 135 140 

His His Asp Lys Lys Lys Asp Ala Pro Ser Gly Thr Ala He Lys Thr 
145 150 155 i 60 



Ala Glu Leu Met Ala Glu Val Arg Glu Ser He Gin Gin Gly Ala Ala 

175 



10 165 170 



Asp Glu Glu Glu Leu He Ala Gly Ala Arg Gly Ala Asp Phe Asp' Gly 
180 185 190 

15 Met Arg He His Ser Val Arg Leu Pro Gly Leu Val Ala His Gin Glu 
"5 200 205 

Val He Phe Gly Asn Gin Gly Glu Gly Leu Thr Leu Arg His Asp Ser 
2Q 210 215 220 

Tyr Asp Arg He Ser Phe Met Thr Gly Val Asn Leu Gly He Lys Glu 
225 230 235 240 

Val Val Lys Arg His Glu Leu Val Tyr Gly Leu Glu His Leu Leu 
25 2 45 250 255 

<210> 142 
<211> 91 
30 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 142 

Met Ala Asn Lys Gin Asp Leu He Ala. Lys Val Ala Glu Ala Thr Glu 
35 1 5 10 15 

Leu Thr Lys Lys Asp Ser Ala Ala Ala Val Glu Ala Val Phe Ala Ala 
20-25 30 

40 Val Ala Asp Tyr Leu Ala Ala Gly Glu Lys Val Gin Leu He Gly Phe 
35 40 45 

Ser Asn Phe Glu Val Arg -Glu Arg Ala Glu Arg Lys Gly Arg Asn Pro 
50 55 60 

45 

Gin Thr Gly Lys Glu Met Thr lie Ala Ala Ser Lys Val Pro Ala Phe* 
65 70 75 so 

Lys Ala Gly Lys Ala Leu Lys Asp Ala Val Lys 
50 . 85 90 



<210> 143 
<211> 306 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 143 

Met Thr Lys Thr Ala Phe Leu Phe Ala Gly Gin Gly Ala Gin Tyr Leu 
15 io 15 

5 Gly Met Gly Arg Asp Phe Tyr Asp Gin Tyr Pro lie Val Lys Glu Thr ' 
20 25 30 

lie Asp Arg Ala Ser Gin Val Leu Gly Tyr Asp Leu Arg Tyr Leu lie 
1Q 35 40 45 

Asp Thr Glu Glu Asp Lys Leu Asn Gin Thr Arg Tyr Thr Gin Pro Ala 
50 55 60 

He Leu Ala Thr Ser Val Ala lie Tyr Arg Leu Leu Gin Glu Lys Glv 
15 65 70 75 qq 

Tyr Gin Pro Asp Met Val Ala Gly Leu Ser Leu Gly Glu Tyr Ser Ala 
85 go 95 

20 Leu Val Ala Ser Gly Ala Leu Asp Phe Glu Asp Ala Val Ala Leu Val 
100 105 no 

Ala Lys Arg Gly Ala Tyr Met Glu Glu Ala Ala Pro Ala Asp Ser Glv 
25 "5 120 125 

Lys Met Val Ala Val Leu Asn Thr Pro Val Glu Val He Glu Glu Ala 
130 135 140 

Cys Gin. Lys Ala Ser Glu Leu Gly Val Val Thr Pro Ala Asn Tyr Asn 
30 145 - 150 155 i 6 o 

Thr Pro Ala Gin He Val He Ala Gly Glu Val Val Ala -Val Asp Arg 
165 170 175 

35 Ala Val Glu Leu Leu Gin Glu Ala Gly Ala Lys Arg Leu He Pro Leu 
180 185 190 

Lys Val Ser Gly Pro Phe His Thr Ala Leu Leu Glu Pro Ala Ser Gin 
' 195 200 205 

40 

Lys Leu Ala Glu Thr Leu Ala Gin Val Ser Phe Ser Asp Phe Thr Cys 
210 215 220 

Pro Leu Val Gly Asn Thr Glu Ala Ala Val Met Gin Lys Glu Asp lie 
45 225 230 235 2 40 

Ala Gin Leu Leu Thr Arg Gin Val Lys. Glu Pro Val Arg Phe Tyr Glu 
245 250. 255 

50 Ser He Gly Val Met. Gin Glu Ala Gly He Ser Asn Phe He Glu He 
260 265 270 

Gly Pro Gly Lys Val Leu Ser Gly Phe Val Lys Lys lie Asp Gin Thr 
55 275 280 285 

. Ala His Leu Ala His Val Glu Asp Gin Ala Ser Leu Val Ala Leu Leu 
290 295 300 
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Glu Lys 
305 

5 

<210> 144 
<211> 243 
<212> PRT 

<213> Streptococcus pneumoniae 

10 

<400> 144 

Met Lys Leu- Glu His Lys Asn lie Phe lie Thr Gly Ser Ser Arq Glv 

15 He Gly Leu Ala He Ala His Lys Phe Ala Gin Ala Gly Ala Asn He 
20 25 30 

Val Leu Asn Ser Arg Gly Ala He Ser Glu Glu Leu Leu Ala Glu Phe 

20 

Ser Asn Tyr Gly He Lys Val Val Pro He Ser Gly Asp Val Ser Asp 
50 55 60 

Phe Ala Asp Ala Lys Arg Met He Asp Gin Ala He Ala Glu Leu Gly 
25- 65 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly He Thr Gin Asp Thr Leu 
85 90 95 

30 Met Leu Lys Met Thr Glu. Ala Asp Phe Glu Lys Val Leu Lys Val Asn 
100 . . 105 no 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Leu Lys Pro Met Met 
35 H5 120 125 

Lys Ala Arg Glu Gly Ala He He Asn Met Ser Ser Val Val Gly Leu 
130 135 140 

Met Gly Asn He Gly Gin Ala Asn Tyr Ala Ala Ser Lys Ala Gly Leu 
40 145 150 . 155 i 60 

He Gly Phe Thr Lys Ser Val Ala Arg Glu Val Ala Ser Arg Asn He 
165 170 175 

45 Arg Val Asn Val He Ala Pro Gly Met He Glu Ser Asp Met Thr Ala 
.180 185 190 

He Leu Ser Asp Lys He Lys Glu Ala Thr Leu Ala Gin He Pro Met 
5Q 1^5 200 205 

Lys Glu Phe Gly Gin Ala Glu Gin Val Ala Asp Leu Thr Val Phe Leu 
210 215 220 

Ala Gly Gin Asp Tyr Leu Thr Gly Gin Val Val Ala He Asp Gly Glv 
55 225 230 2 35 240 

Leu Ser Met 
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<210> 145 
5 <211> 276 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 145 

10 Met Gly Val Lys Lys Lys Leu Lys Leu Thr Ser Leu Leu Gly Leu Ser 
1 5 .10 15 

Leu Leu He Met Thr Ala Cys Ala Thr Asn Gly Val Thr Ser Asp He 
20 25 30 

Thr Ala Glu Ser Ala Asp Phe Trp Ser Lys Leu Val Tyr Phe Phe Ala 
35 4t) 45 

Glu He He Arg Phe Leu Ser Phe Asp He- Ser lie Gly Val Gly He 
20 50 55 60 

He Leu Phe Thr Val Leu He Arg Thr Val Leu Leu Pro Val Phe Gin 
65 ™ 75 80 

25 Val Gin Met Val Ala Ser Arg Lys Met Gin Glu Ala Gin Pro Arg He 

85 90 95 



15 



30 



Lys Ala Leu Arg Glu Gin Tyr Pro Gly Arg Asp Met Glu Ser Arg Thr 
100 105 no 

Lys Leu Glu Gin Glu Met Arg Lys Val Phe Lys Glu Met Gly Val Arg 
H5 120 125 

Gin Ser Asp Ser Leu Trp Pro lie Leu lie Gin Met Pro Val He Leu 
130 135 140 

Ala Leu Phe Gin Ala Leu Ser Arg Val Asp Phe Leu Lys Thr Glv His 
145 150 155 leo 

40 Phe Leu Trp lie Asn Leu Gly Ser Val Asp Thr Thr Leu Val Leu Pro 

165 170 



175 



lie Leu Ala Ala Val Phe Thr Phe Leu Ser Thr Trp Leu Ser Asn Lys 
45 I 80 185 190 

Ala Leu Ser Glu Arg Asn Gly Ala Thr Thr Ala Met Met Tyr Gly He 
195 200 205 

Pro Val Leu He Phe lie Phe Ala Val Tyr Ala Pro Gly Gly Val Ala 
50 210 215 220 

Leu Tyr Trp Thr Val Ser Asn Ala Tyr Gin Val Leu Gin Thr Tyr Phe 
225 230 235 240 

55 Leu Asn Asn Pro Phe Lys lie lie Ala Glu Arg Glu Ala Val Val Gin 

245 250 255 
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Ala Gin Lys Asp Leu Glu Asn Arg Lys Arg Lys Ala Lys Lys Lvs Ala 
260 265 270 



Gin Lys Thr Lys 
5 275 



<210> 146 
<211> 409 
10 <212> PRT 

<213> Streptpcoccus pneumoniae 

<400> 146 

Met Lys He Ser Lys Arg His Leu Leu Asn Tyr Ser He Leu He Pro 
15 1 5 10 is 

Tyr Leu Leu Leu Ser He Leu Gly Leu He Val Val Tyr Ser Thr Thr 
20 25 30 

20 Ser Ala He Leu He Glu Glu Gly Lys Ser Ala Leu Gin Leu Val Ara 
35 40 45 

Asn Gin Gly He Phe Trp He Val Ser Leu He Leu He Ala Leu He 

25 

Tyr Lys Leu Arg Leu Asp Phe Leu Arg Asn Glu Arg Leu He lie Leu 
65 70 75 80 

Val He Leu He Glu Met Leu Leu Leu Phe Leu Ala Arg Phe He Gly 
30 ' 85 90 95 

He Ser Val Asn Gly Ala Tyr Gly Trp He Ser Val Ala Gly Val Thr 
100 105 no 

35 He Gin Pro Ala Glu Tyr Leu Lys He He He lie Trp Tyr Leu Ala 
115 120 125 

His Arg Phe Ser Lys Gin Gin Glu Glu lie Ala Thr Tyr Asp Phe Gin 
130 135 140 

40 

Val Leu Thr Gin Asn Gin Trp Leu Pro Arg Ala Phe Asn Asp Trp Ara 
145 150 155 16 o 

Phe Val Leu Leu Val Leu lie Gly Ser Leu Gly lie Phe Pro Asp Leu 
45 165 170 175 

Gly Asn Ala Thr lie Leu Val Leu Val Ser Leu lie Met Tyr Thr Val 
180 185 190 

50 Ser Gly He Ala Tyr Arg Trp Phe Ser Thr He Leu Ala Leu Val Ser 
195 200 205 

Ala Thr Ser Val Phe Val Leu Thr Thr He Ser Leu lie Gly Val Glu 
55 210 215 220 

Thr Phe Ser Lys lie Pro Val Phe Gly Tyr Val Ala Lys Arg Phe Ser 
225 2 30 235 240 
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Ala Phe Phe Asn Pro Phe Ala Asp Arg Ala Asp Ala Gly His Gin Leu 
245 250 255 

5 Ala Asn Ser Tyr Phe Ala Met Val Asn Gly Gly Trp Phe Gly Leu Gly 
260 265 270 

Leu Gly Asn Ser lie Glu Lys Arg Gly Tyr Leu Pro Glu Ala His Thr 

275 o q n rt « „ 

10 



275 280 285 



Asp Phe Val Phe Ser lie Val lie Glu Glu Phe Gly Phe Val Gly Ala 
290 295 300 • 

Ser Leu lie Leu Ala Leu Leu Phe Phe Met lie Leu Arg lie lie Leu 
LD 305 310 -si c 

JAU 315 320 

Val Gly He Arg Ala Glu Asn Pro Phe Asn Ala Met Val Ala Leu Glv 
325 330 335 

20 Val Gly Gly Met Met Leu Val Gin Val Phe Val Asn lie Gly Gly lie 
340 345 35Q 



Ser Gly Leu He Pro Ser Thr Gly Val Thr Phe Pro Phe Leu Ser Gin 

355 *3cn 

25 



355 3 6 o 365 



Gly Gly Asn Ser Leu Leu Val Leu Ser Val Ala Val Ala Phe Val Leu 
370 375 380 

30 ™< ASP Ma 6lU LyS Arg Ala Lys Leu T * r Ar 9 Glu Leu Glu 

JU 385 390 395 40O 

Asn Gin Pro Met Asn Leu Leu Leu Lys 
405 

35 

<210> 147 
<211> 419 
<212> PRT 
^ <213> Streptococcus pneumoniae 

<400> 147 

Met Leu Gly He Leu Thr Phe He Leu Val Phe Gly He He Val Val 
1 ' 5 10 15 

45 Val His" Glu Phe Gly His Phe Tyr Phe Ala Lys Lys Ser Gly He Leu 
20 - 25 30 



Val Arg Glu Phe Ala He Gly Met Gly Pro Lys lie Phe Ala His H e 

50 



35 40 45 . 



Gly Lys Asp Gly Thr Ala Tyr Thr He Arg lie Leu Pro Leu Gly Gly 



55 60 



Tyr Val Arg Met Ala Gly Trp Gly Asp Asp Thr Thr Glu He Lys Thr 
M 65 70 75 80 

Gly Thr Pro Val Ser Leu Thr Leu Ala Asp Asp Gly Lys Val Lys Arg 
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85 



90 



95 



He Asn Leu Ser Gly Lys Lys Leu Asp Gin Thr Ala Leu Pro Met Gin 
100 105 HO 

Val Thr Gin Phe Asp Phe Glu Asp Lys Leu Phe He Lys Gly Leu Val 
115 120 125 * 

Leu Glu Glu Glu Lys Thr Phe Ala Val Asp His Asp Ala Thr Val Val 
130 135 140 

Glu Ala Asp Gly Thr Glu Val Arg He Ala Pro Leu Asp Val Gin Tyr 
145 150 155 160 

Gin Asn Ala Thr He Trp Gly Lys Leu He Thr Asn Phe Ala Gly Pro 
165 170 175 

Met Asn Asn Phe He Leu Gly Val Val Val Phe Trp Val Leu He Phe 
180 185 190 

Met Gin Gly Gly Val Arg Asp Val Asp Thr Asn Gin Phe His He Met 
195 200 205 

Pro Gin Gly Ala Leu Ala Lys Val Gly Val Pro Glu Thr Ala Gin He 
210 215 220 

Thr Lys He Gly Ser His Glu Val Ser Asn Trp Glu Ser Leu He Gin 
225 230 235 240 

Ala Val Glu Thr Glu Thr Lys Asp Lys Thr. Ala Pro Thr Leu Asp Val 
• 245 250 255 

Thr He Ser Glu Lys Gly Ser Asp Lys Gin Val Thr Val Thr Pro Glu 
260 265 270 

Asp Ser Gin Gly Arg Tyr Leu Leu Gly Val Gin Pro Gly Val Lys Ser 
275 280 285 

Asp Phe Leu Ser Met Phe Val Gly Gly Phe Thr Thr Ala Ala Asp Ser 
290 295 300 

Ala Leu Arg He Leu Ser Ala Leu Lys Asn Leu lie Phe Gin Pro Asp 
305 310 315 320 

Leu Asn Lys Leu Gly Gly Pro Val Ala He Phe. Lys Ala Ser Ser Asp 
325 330 335 

Ala Ala Lys Asn Gly He Glu Asn He" Leu Tyr Phe Leu Ala Met He 
340 345 350 

Ser He Asn He Gly He Phe Asn ieu lie Pro He Pro Ala Leu Asp 
355 360 365 

Gly Gly Lys lie Val Leu Asn lie Leu Glu Ala lie Arg Arg Lys Pro 
370 375 380 



Leu Lys Gin Glu lie Glu Thr Tyr Val Thr Leu Ala Gly Val Val lie 
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385 



390 



395 



400 



Met Val Val Leu Met lie Ala Val Thr Trp Asn Asp He Met Arg Leu 
405 410 415 



Phe Phe Arg 



10 <210> 148 
<211> 197 
. <212> PRT 
<213> Streptococcus pneumoniae 

15 <400> 148 

Met Tyr Ala Tyr Leu Lys Gly lie He Thr Lys He Thr Ala Lys Tyr 
1 ■ 5 10 15 

He Val Leu Glu Thr Asn Gly He Gly Tyr He Leu His Val Ala Asn 
20 20 25 30 

Pro Tyr Ala Tyr Ser Gly Gin Val Asn Gin Glu Ala Gin He Tyr Val 
35 40 45 

25 His Gin Val Val Arg Glu Asp 'Ala His Leu Leu Tyr Gly Phe Arq Ser 
5 ° .55 60 

Glu Asp Glu Lys Lys Leu Phe Leu Ser Leu He Ser Val Ser Gly He 

65 7 0 75 fln 

30 80 

Gly Pro Val Ser Ala Leu Ala He He Ala Ala Asp Asp Asn Ala Gly 
85 90 '95 

Leu Val Gin Ala He Glu Thr Lys Asn He Thr Tyr Leu Thr Lys Phe 
35 100 105 no 

Pro Lys He Gly Lys Lys Thr Ala Gin Gin Met Val Leu Asp Leu Glu 
115 120 125 

40 Gly Lys Val Val Val Ala Gly Asp Asp Leu Pro Ala Lys Val Ala Val 
130 135 140 ' 

Gin Ala Ser Ala Glu Asn Gin Glu Leu Glu Glu Ala Met Glu Ala Met 
45 145 150 155 i 6 o 

Leu Ala Leu Gly Tyr Lys Ala Thr Glu Leu Lys Lys lie Lys Lys Phe 
165 ..170 175 

Phe Glu Gly Thr Thr Asp Thr Ala Glu Asn Tyr He. Lys Ser Ala Leu 
^ U 180 . 185 190 

■ Lys Met Leu Val Lys 
195 

55 

<210> 149 
<211> 257 
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<212> PRT 

<213> Streptococcus pneumoniae 
<400> 149 

5 Met Lys Asn Asn Arg lie Leu Ala Leu Ser Gly Asn Asp lie Phe Ser 
1 5 10 15 

Gly Gly Gly Leu Ser Ala Asp Leu Ala Thr Tyr Thr Leu Asn Gly Leu 

10 

His Gly Phe Val Ala Val Thr Cys Leu Thr Ala Leu Thr Glu Lys Gly 
35 40 45 

Phe Glu Val Phe Pro Thr Asp Asp Thr He Phe Gin His Glu Leu Asp 
15 50 55 60 - 

Ser Leu Arg Asp Val Glu Phe Gly Gly He Lys He Gly Leu Leu Pro 
65 70 75 80 

Thr Val Ser Val Ala Glu Lys Ala Leu Asp Phe He Lys Gin Arg Pro 
85 90 95 

Gly Val Pro Val Val Leu Asp Pro Val Leu Val Cys Lys Glu Thr His 
100 105 HO 

Asp Val Ala Val Ser Glu Leu Cys Gin Glu Leu He Arg Phe Phe Pro 
115 120 125 

Tyr Val Ser Val He Thr Pro Asn Leu Pro Glu Ala Glu Leu Leu Ser 
130 135 140 

Gly Gin Glu He Lys Thr Leu Glu Asp Met Lys Thr Ala Ala Gin Lys 
145 150 155 160 

Leu His Asp Leu Gly Ala Pro Ala Val He lie Lys Gly Gly Asn Arg 
165 170 175 

Leu Ser Gin Asp Lys Ala Val Asp Val Phe Tyr Asp Gly Gin Thr Phe 
180 185 190 

Thr He Leu Glu Asn Pro Val He Gin Gly Gin Asn Ala Gly Ala Gly 
195 - 200 205 

Cys Thr Phe Ala Ser Ser He Ala Ser His Leu Val Lys Gly Asp Lys 
210 215 220 

Phe Leu Pro Ala Val Glu Ser Ser Lys-Ala Phe Val Tyr Arg Ala He 
225 230 235 240 

Ala Gin Ala Asp Gin Tyr Gly Val Arg Gin Tyr Glu Ala Asn Lys Asn 
245 250 . 255 

Asn 



<210> 150 
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<211> 412 
<212> PRT 

<213> Streptococcus pneumoniae 



5 <400> 150 

Met lie Glu Thr Glu Lys Lys Glu Glu Arg Val Leu Leu He Gly Val 



10 



15 



Glu Leu Gin Gly Met Asp Ser Phe Asp Leu Ser Met Glu Glu Leu Ala 
20 25 3 0 

Ser Leu Ala Lys Thr Ala Gly Ala Val Val Val Asp Ser Tyr Arg Gin 
35 40 45 

15 Lys Arg Glu Lys Tyr Asp Ser Lys Thr Phe Val Gly Ser Gly Lys Leu 
^ U 55 go 

Glu Glu He Ala Leu Met Val Asp Ala Glu Glu lie Thr Thr Val He 
6i> 70 ?t 

20 ■ . 75 80 

Val Asn Asn Arg Leu Thr Pro Arg Gin Asn Val Asn Leu Glu Glu Val 
85 90 3 5 

25 Gly Val ^ Val 116 Asp Ar 9 Met Gln Leu Leu Asp lie Phe 

100 105 uo . 

Ala Met Arg Ala Arg Ser His Glu Gly Lys Leu Gin Val His Leu Ala 
115 120 125 

30 Gin Phe Lys Tyr Leu Leu Pro Arg Leu Val Gly Gin Gly He Met Leu 
1JU 135 140 

Ser Arg Gin Ala Gly Gly He Gly Ser Arg Gly Pro Gly Glu Ser Gin 

35 150 135 160 

Leu Glu Leu Asn Arg Arg Ser Val Arg Asn Gin He Thr Asp He Glu 
165 170 175 

40 ** g Gln LSU ^ Val Val Glu ^ A ?n Arg Ala Thr Val Arg Glu Lys 
180 185 i 90 y 

Arg Leu Glu Ser Ser Thr Phe Lys He Gly Leu He Gly Tyr Thr Asn 
195 200 205 

45 Ala Gly Lys Ser Thr He Met Asn He Leu Thr Ser Lys Thr Gin Tyr 
^l0 215 



220 



Glu Ala Asp Glu Leu Phe Ala Thr Leu Asp Ala Thr Thr Lys Ser lie 
50 230 235 - 240 

His Leu Gly Gly Asn Leu Gin Val Thr Leu Thr Asp Thr Val Gly Phe 
245 250 255 

^ lie Gin Asp Leu Pro Thr Glu Leu Val Ser Ser Phe Lys'ser Thr Leu 
250 265 270 



Glu Glu Ser Lys His Val Asp Leu Leu. Val His Val He Asp Al 



a Ser 
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27 5 280 285 

Asn Pro Tyr His Glu Glu His Glu Lys Thr Val Leu Ser lie Met Lvs 
29 ° ; 295 300 

Asp Leu Asp Met Glu Asp lie Pro His Leu Thr Leu Tyr Asn Lys Ala 

315 320 



305 310 



Asp Leu Val Glu Asp Phe Thr Pro Thr Gin Thr Pro Tyr Thr Leu II 



10 325 



330 335 



e 



Ser Ala Lys Ser Glu Asp Ser Arg Glu Asn Leu Gin Ala Leu Leu Leu 
340 345 350 

15 Asp Lys He Lys Glu He Phe Glu Ala Phe Thr Leu Arg Val Pro Phe 
355 .360 . 365 

. Ser Lys Ser Tyr Lys He His Asp Leu Glu Ser Val Ala He Leu Glu 
20 37 °' ■ 375 

Glu Arg Asp Tyr Gin Glu Asp Gly. Glu Val He Thr Gly Tyr He Ser 

400 



385 390 395 



Glu Lys Asn Lys Trp Arg Leu Glu Glu Phe Tyr Asp 
25 405 4io 

<210> 151 
<211> 160 
30 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 151 

Met Ala Glu Lys Thr Tyr Pro Met Thr Leu Glu Glu Lys Glu Lys Leu 
1 5 10 15 

Glu Lys Glu Leu Glu Glu Leu Lys Leu Val Arg Arg Pro Glu Val Val 
20 25 30 

40 Glu Arg He Lys He Ala Arg Ser Tyr Gly Asp Leu Ser Glu Asn Ser 
35 40 45 ■ 

Glu Tyr Glu Ala Ala Lys Asp Glu Gin Ala Phe Val Glu Gly Gin He 
45 50 55 60 

Ser Ser Leu Glu Thr Lys He Arg Tyr Ala Glu He Val Asn Ser Asp 
65 70 75 8 o 

Ala Val Ala Gin Asp Glu Val Ala lie Gly Lys Thr Val Thr He Gin 
W 85 90 95 

Glu He Gly Glu Asp Glu Glu Glu Val Tyr He He Val Gly Ser Ala 
100 105 no 

55 Gly Ala Asp Ala Phe Ala Gly Lys Val Ser Asn Glu Ser Pro He Glv 
115 120 125 
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Gin Ala Leu lie Gly Lys Lys Thr Gly Asp Thr Ala Thr lie Glu Thr 
130 135 140 

Pro Val Gly Ser Tyr Asp Val Lys lie Leu Lys Val Glu Lys Thr Ala 
^ 145 150 155 



160 



10 

<210> 152 
<211> 189 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 152 

Met Thr Lys Leu Leu Val Gly Leu Gly Asn Pro Gly Asp Lys Tyr Phe 
1 5 .10 • 15 

20 Glu Thr Lys His Asn Val Gly Phe Met Leu He Asp Gin Leu Ala Lys 
20 25 30 

Lys Gin Asn Val Thr Phe Thr His Asp Lys He Phe Gin Ala Asp Leu 
25 35 40 45 

Ala Ser Phe Phe Leu Asn Gly Glu Lys He Tyr Leu Val Lys Pro Thr 
50 55 60 

Thr Phe Met Asn Glu Ser Gly Lys Ala Val His Ala Leu Leu Thr Tvr 
30 65 70 75 80 

Tyr Gly Leu Asp He Asp Asp Leu Leu He He Tyr Asp Asp Leu Asp 
85 90 95 

35 Met Glu Val Gly Lys He Arg Leu Arg Ala Lys Gly Ser Ala Gly Glv 
100 105 . no 

His Asn Gly He Lys Ser He lie Gin His He Gly Thr Gin Val Phe 
40 115 120 125 

Asn Arg Val Lys He Gly He Gly Arg Pro Lys Asn Gly Met Ser Val 
130 135 140 

Val His His Val Leu Ser Lys Phe Asp Arg Asp Glu Tyr He Gly He 
45 145 150 155 i 6 o 

Leu Gin Ser Val Asp Lys Val Asp Asp. Ser Val Asn Tyr Tyr Leu Gin 
165 . 170 175 

50 Glu Lys Asn Phe Glu Lys Thr Met Gin Arg Tyr Asn Gly 
180 185 



<210> 153 . 
55 <211> 283 
<212> PRT 

<213> Streptococcus pneumoniae 
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10 



20 



25 



35 



40 



<400> 153 

Met He Leu- He Thr Gly Ala Asn Gly Gin Leu Gly Thr Glu Leu Arg 
1 5 10 15 

Tyr Leu Leu Asp Glu Arg Asn Glu Glu Tyr Val Ala Val Asp Val Ala 
20 25 30 

Lys Met Asp He Thr Asn Glu. Glu Met Val Glu Lys Val Phe Glu Glu 
35 40 45 

Val Lys Pro Thr Leu Val 'Tyr His Cys Ala Ala Tyr Thr Ala Val Asd 
50 55 60 



15 Ala Ala Glu Asp Glu Gly Lys Glu Leu Asp Phe Ala lie Asn Val 



65 



70 



75 



Thr 
80 



Gly Thr Lys Asn Val Ala Lys Ala Ser Glu Lys His Gly Ala Thr Leu 
85 90 95 . 

Val Tyr He Ser Thr Asp Tyr Val Phe Asp Gly Lys Lys Pro Val Glv 
100 ms U 0 y 



Gin Glu Trp Glu Val Asp Asp Arg Pro Asp Pro Gin Thr Glu Tyr Gly 
115 120 125 

Arg Thr Lys Arg Met Gly Glu Glu Leu Val Glu Lys His Val Ser Asn 
130 135 140 

30 Phe Tyr He He Arg Thr Ala Trp Val Phe Gly Asn Tyr Gly Lys Asn 
145 150 * 155 



160 



Phe Val Phe Thr Met Gin Asn Leu Ala Lys Thr His Lys Thr Leu Thr 
165 , 170 175 

Val' Val Asn Asp Gin Tyr Gly Arg Pro Thr Trp Thr Arg Thr Leu Ala 
I 80 .185 190 

Glu Phe Met Thr Tyr Leu Ala Glu Asn Arg Lys Glu Phe Gly Tyr Tyr 
195 200 205 



His Leu Ser Asn Asp Ala Thr Glu Asp Thr Thr Trp Tyr Asp Phe Ala 
210 215 220 

45 Val Glu He Leu Lys Asp Thr Asp Val Glu Val Lys' Pro Val Asp Ser 
225 2 30 235 240 



50 



Ser Gin Phe Pro Ala Lys Ala Lys Arg Pro Leu Asn Ser Thr Met Ser 
2 *5 250 255 

Leu Ala Lys -Ala Lys Ala Thr Gly Phe Val lie Pro Thr Trp Gin Asd 
260 265 270 



Ala Leu Gin Glu Phe Tyr Lys Gin Glu Val Arq 
55 .275 280 
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<210> 154 

<2U> 407 

5 <212> PRT 

<213> Streptococcus pneumoniae 



10 



<400> 154 

Met Lys Arg Ser Leu Asp Ser Arg Val Asp Tyr Ser Leu Leu Leu Pro 
15 10 15 

Val Phe Phe Leu Leu Val He Gly Val Val Ala lie Tyr He Ala Val 
20 25 30 

15 Ser His Asp Tyr Pro Asn Asn He Leu Pro He Leu Gly Gin Gin Val 
35 40 45 



20 



Ala Trp He Ala Leu Gly Leu Val lie Gly Phe Val Val Met Leu Phe 

50 55 60 

Asn Thr Glu Phe Leu Trp Lys Val Thr Pro Phe Leu Tyr He Leu Gly 

65 70 75 80 



Leu Gly Leu Met He Leu Pro He Val Phe Tyr Asn Pro Ser Leu Val 
25 85 90 95 

Ala Ser -Thr Gly Ala Lys Asn Trp Val Ser He Asn Gly He Thr Leu 
100 105 no 

30 Phe Gin Pro Ser Glu Phe Met Lys He Ser Tyr He Leu Met Leu Ala 
115 120 125 

Arg Val He Val Gin Phe Thr Lys Lys His Lys Glu Trp Arg Arg Thr 
35 130 135 140 

. Val Pro Leu Asp Phe Leu Leu He Phe Trp Met He Leu Phe Thr He 
145 150 155 160 

Pro Val Leu Val Leu Leu Ala Leu Gin Ser Asp Leu Gly Thr Ala Leu 
40 165 170 175 

Val Phe Val Ala He Phe Ser Gly He Val Leu Leu Ser Gly Val Ser 
180 185 190 

45 Trp Lys lie He He Pro Val Phe Val Thr Ala Val Thr Gly Val Ala 
195 200 205 

Gly Phe Leu Ala He Phe He Ser Lys Asp Gly Arg Ala Phe Leu His 
210 215 220 

50 

Gin He Gly Met Pro Thr Tyr Gin He Asn Arg He Leu Ala Trp Leu 
225 230 235 240 

Ash Pro Phe Glu Phe Ala Gin Thr Thr Thr Tyr Gin Gin Ala Gin Gly 
55 245 250 255 



Gin He Ala lie Gly Ser Gly Gly Leu Phe. Gly Gin Gly Phe Asn Al 



a 
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260 265 270 

Ser Asn Leu Leu lie Pro Val Arg Glu Ser Asp Met He Phe Thr Val 
275 280 285 

He Ala Glu Asp Phe Gly Phe He Gly Ser Val Leu Val He Ala Leu 
250 295 300 

Tyr Leu Met Leu He Tyr Arg Met Leu Lys He Thr Leu Lys Ser Asn 
10 305 310 315 3 2 o 

Asn Gin Phe Tyr Thr Tyr He Ser Thr Gly Leu He- Met Met Leu Leu 
325 330 335 

15 Phe His He Phe Glu Asn lie Gly Ala Val Thr Gly Leu Leu Pro Leu 
340 345 350 

Thr Gly He Pro Leu Pro Phe He Ser Gin Gly Gly Ser Ala He He 
2Q 355 360 365 

Ser Asn Leu He Gly Val Gly Leu Leu Leu Ser Met Ser Tyr Gin Thr 
370 375 380 

Asn Leu Ala Glu Glu Lys Ser Gly Lys Val Pro Phe Lys Arg Lys Lvs 
25 385 390 395 — ~ 



400 



30 



35 



Val Val Leu Lys Gin lie Lys 
405 



<210> 155 
<211> 202 
<212> PRT 

<213> Streptococcus pneumoniae' 



<400> 155 

Met Gly Lys He lie Gly lie Thr Gly Gly He Ala Ser Gly Lys Ser 
1 5 10 15 

40 Thr Val Thr Asn Phe Leu Lys His Gin Gly Leu Ser Ser Ser Gly Leu 
20 25 30 



45 



Pro Thr. Gin Cys Ser Thr Asn Tyr Arg Lys Pro Gly Gly Arg Leu Phe 

. 35 40 45 

Glu Ala Leu VaJ. Gin His Phe Gly Gin Glu lie lie Leu Glu Asn Glv 

50 55 ... 60 



Glu Leu Asn Arg Pro Leu lie Ala Ser Leu lie Phe Ser Asn Pro Glu 
50 65 70 75 80 

Glu Gin Lys Trp Ser Asn Gin lie Gin Gly Glu lie lie Arg Glu Glu 
. 85 . 90 95 

55 Leu Ala Thr Leu Arg Glu Gin Leu Ala Gin Thr Glu Glu lie Phe Phe 
100 105 HO 
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Met Asp lie Pro Leu Leu Phe Glu Gin Asp Tyr Ser Asp Trp Phe Ala 
115 !20 125 

Glu* Thr Trp Leu Val Tyr Val Asp Arg Asp Ala Gin Val Glu Arq Leu 
5 130 135 140 ' 

Met Lys Arg Asp Gin Leu Ser Lys Asp Glu Ala Glu" Ser Arg Met Ala 
145 150 155 i 60 

10 Ala Gin Trp Pro Leu Glu Lys Lys Lys Asp Leu Ala Ser Gin Val Leu 

165 170 175 



15 



Asp Asn Asn Gly Asn Gin Asn Gin Leu Leu Asn Gin Val His lie Leu 
180 185 190 

Leu Glu Gly Gly Arg Gin Asp Asp Arg Asp 
195 200 



20 <210> 156 
<211> 419 
<212> PRT 

<213> Streptococcus pneumoniae 
25 <400> 156 

Met Arg Lys lie Val He Asn Gly Gly Leu Pro Leu Gin Gly Glu He 
1 5 io 15 . 

Thr He Ser Gly Ala Lys Asn Ser Val Val Ala Leu He Pro Ala He 
30 20 25 30 

He Leu Ala Asp Asp Val Val Thr Leu Asp Cys Val Pro Asp He Ser 
35 40 45 

35 Asp Val Ala Ser Leu Val Glu He Met Glu Leu Met Gly Ala Thr Val 
50 55 60 

Lys Arg Tyr Asp Asp Val Leu Glu He Asp Pro Arg Gly* Val Gin Asn 

40 - 65 70 75 80 

He Pro Met Pro Tyr Gly Lys lie Asn Ser Leu Arg Ala Ser Tyr Tyr 
85 90 95 

Phe Tyr Gly Ser Leu Leu Gly Arg Phe Gly Glu Ala Thr Val Gly Leu 
45 100 105 110 

Pro Gly Gly Cys Asp Leu Gly Pro Arg Pro He Asp Leu His Leu Lys 
115 120 125 

50 Ala Phe Glu Ala Met Gly Ala Thr Ala Ser Tyr Glu Gly Asp Asn Met 
130 135 140 



55 



Lys Leu Ser Ala Lys Asp Thr Gly Leu His Gly Ala Ser He Tyr Met 

145 150 155 * 160 

Asp Thr Val Ser Val Gly Ala Thr He Asn Thr Met lie Ala Ala Val 
165 170 i 75 
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Lys Ala Asn Gly Arg Thr He lie Glu Asn Ala Ala Arg Glu Pro Glu 
180 185 19Q 



5 lie He Asp Val Ala Thr Leu Leu Asn Asn Met Gly Ala His lie Arg 
195 200 205 

Gly Ala Gly Thr Asn lie lie lie He Asp Gly Val Glu Arg Leu His 



10 



Gly Thr Arg His Gin Val He Pro Asp Arg He Glu Ala Gly Thr Tyr 
230 235 240 



lie Ser Leu Ala Ala Ala Val Gly Lys Gly lie Arg lie Asn Asn 
lJ 245 



250 



Val 



255 



Leu Tyr Glu His Leu Glu Gly Phe He Ala Lys Leu Glu Glu Met Gly 
260 265 270 

20 Val Arg Met Thr Val Ser Glu Asp Ser He Phe Val Glu Glu Gin Ser 
275 280 285 

Asn Leu Lys Ala He Asn He Lys Thr Ala Pro Tyr Pro Gly Phe Ala 
25 290 295 300 

Thr Asp Leu Gin Gin Pro Leu Thr Pro Leu Leu Leu Arg Ala Asn Gly 
310 315 320 

Arg Gly Thr He Val Asp Thr He Tyr Glu Lys Arg Val Asn His Val 
325 33 0 

Phe Glu Leu Ala Lys Met Asp Ala Asp He Ser Thr Thr Asn Gly His 
J4 ° 345 350 

35 He Leu Tyr Thr Gly Gly Arg Asp Leu Arg Gly Ala Ser Val Lys Ala 
•"^ 360 365 

Thr Asp Leu Arg Ala Gly Ala Ala Leu Val He Ala Gly Leu Met Ala 
J/u 375 380 

Glu Gly Lys Thr Glu lie Thr Asn He Glu Phe He Leu Arg Gly Tyr 



40 



45 



50 



55 



Ser Asp lie He Glu Lys Leu Arg Asn Leu Gly Ala Asp He Arg Leu 

Val Glu Asp 



405 410 41 g 



<210> 157 
<211> 231 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 157 

Met Ser Arg He Glu Phe Ser Pro Ser . Leu Met Thr Met Asp Leu Asp 
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1 5 10 15 

Lys Phe Lys Glu Gin lie Thr The Leu Asn Asp Lys Val Ala Ser Tvr 
5 20 25 30 

His He Asp He Met Asp Gly His Phe Val Pro Asn He Thr Leu Ser 
35 4.0 45 

Pro Trp Phe He Gin Glu Val Gin Lys He Ser Asp Thr Pro Leu Ser 
10 50 ; 55 60 

Val His Leu Met Val Thr Asp Pro Thr Phe Trp Val Asp Gin Val Leu 
65 70 75 eo 

15 Asp Leu Gin Cys Glu Tyr He Cys He His Ala Glu Val Leu Asn Gly 

85 90 95 



20 



35 



Leu Ala Phe Arg Leu He Asp Lys He His Asp Ala Gly Leu Lys Ala 
100 105 no 

Gly Val Val Leu Asn Pro Glu Thr Pro Val Ser Thr He Phe Pro Tvr 
H5 120 125 



He Asp Leu Leu Asp Lys Val Thr He Met Thr Val Asp Pro Gly Phe 
23 130 135 140 

Ala Gly Gin Arg Phe Leu Glu Ser Thr Leu Tyr Lys He Gin Glu Leu 
145 150 155 160 

30 Arg Gin Leu Arg Val Gin Asn Gly Tyr His Tyr He He Glu Met Asp 

165 170 175 



Gly Ser Ser Ser Arg Lys Thr Phe Lys Gin He Asp Val Ala Gly Pro 
180 185 190 

Asp He Tyr Val lie Gly Arg Ser Gly Leu Phe Gly Leu Asp Asp Asd 
195 200 205 



lie Ala Lys Ala Trp Asp He Cys Ser Arg Asp Tyr Glu Glu Met Thr 
4U 210 215 220 

Gly Lys Thr Met Pro He Lys 
225 230 

45 

<2X0> 158 
<211> 374 
<212> PRT 

<213> Streptococcus pneumoniae 

50 

<400> 158 

Met Arg Asn Met Ala Leu Thr Ala Gly He Val Gly Leu Pro Asn Val 
1 5 io "is 

55 Gly Lys Ser Thr Leu Phe Asn Ala He Thr Lys Ala Gly Ala Glu Ala 
20 . . 25 30 
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Ala Asn Tyr Pro Phe Ala Thr lie Asp Pro Asn Val Gly Met Val Glu 
35 40 45 

Asp Pro Asp Glu Arg Leu Gin Lys Leu Thr Glu Met He Thr Pro Lys 
5 50 55 60 

Lys Thr Val Pro Thr Thr Phe Glu Phe Thr Asp He Ala Gly He Val 
65 70 75 bo 

10 Lys Gly Ala Ser Lys Gly Glu Gly Leu Gly Asn Lys Phe Leu Ala Asn 

85 90 95 



15 



He Arg Glu Val Asp Ala He Val His Val Val Arg Ala Phe Asp Asp 
100 105 no 

Glu Asn Val Met Arg Glu Gin Gly Arg Glu Asp Ala Phe Val Asp Pro 
115 120 125 



Leu Ala Asp He Asp Thr He Asn' Leu Glu Leu He Leu Ala Asp Leu 
20. 130 135 140 

Glu Ser Val Asn Lys Arg Tyr Ala Arg Val Glu Lys Met Ala Arg Thr 
14 5 150 155 160 

25 Gin Lys Asp Lys Glu Ser Val Ala Glu Phe Asn Val Leu Gin Lys He 

165 170 175 



30 



Lys Pro Val Leu Glu Asp Gly Lys Ser Ala Arg Thr He Glu Phe Thr 
180 185 190 

Asp Glu Glu Gin Lys Val Val Lys Gly Leu Phe Leu Leu Thr Thr Lys 
195 200 205 



Pro Val Leu Tyr Val Ala Asn Val Asp Glu Asp Val Val Ser Glu Pro 
35 210 215 220 

Asp Ser He Asp Tyr Val Lys Gin He Arg Glu Phe Ala Ala Thr Glu 
225 230 . 235 240 

40 Asn Ala Glu Val Val Val lie Ser Ala Arg Ala Glu Glu Glu He Ser 

245 250 255 



45 



Glu Leu Asp Asp Glu Asp Lys Lys Glu Phe Leu Glu Ala He Gly Leu 

260 265 270 

Thr Glu Ser Gly Val Asp Lys Leu Thr Arg Ala Ala Tyr His Leu Leu 
275 280 . 285 



Gly Leu Gly Thr Tyr Phe Thr Ala Gly Glu Lys Glu Val Arg Ala Trp 
50 290 295 300 



Thr Phe Lys Arg Gly Met Lys Ala Pro Gin Ala Ala Gly He He His 

3 °5 310 315 320 

55 Ser Asp Phe Glu Lys Gly Phe He Arg Ala Val Thr Met Ser Tyr Glu 

325 330 335 
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Asp Leu Val Lys Tyr Gly Ser Glu Lys Ala Val Lys Glu Ala Gly Arq 
340 345 350 * 

Leu Arg Glu Glu Gly Lys Glu Tyr He Val Gin Asp Gly ' Asp He Met 
^ 355 360 365 



10 



15 



Glu Phe Arg Phe Asn Val 
370 



<210> 159 
<211> 110 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 159 

Met- Glu He Glu Lys Thr Asn Arg Met Asn Ala Leu Phe Glu Phe Tyr 
1 5 io 



15 



20 Ala Ala Leu Leu Thr Asp Lys Gin Met Asn Tyr He Glu Leu Tvr Tvr 
20 25 30 

Ala Asp Asp Tyr Ser Leu Ala Glu lie Ala Glu Glu Phe Gly Val Ser 
25 35 40 45 

Arg Gin Ala Val Tyr Asp Asn He Lys Arg Thr Glu Lys He Leu Glu 
50 55 60 

Asp Tyr Glu Met Lys Leu His Met Tyr Ser Asp Tyr He Val Arq Ser 
30 65 70 75 go 

Gin He Phe Asp Gin He Leu Glu Arg Tyr Pro Lys Asp Asp Phe Leu 
85 90 95 

35 Gin Glu Gin He Glu He Leu Thr Ser lie Asp Asn Arg Glu 
100 105 no 

<210> 160 
40 <211> 223 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 160 

45 Met Thr Leu Glu Trp Glu Glu Phe Leu Asp Pro Tyr lie Gin Ala Val 
1 5 in 



15 



Gly Glu Leu Lys lie Lys Leu Arg Gly lie Arg Lys Gin Tyr Arg Lvs 
20 25 * ™ 

50 30 

Gin Asn Lys His Ser Pro lie Glu Phe Val Thr Gly Arg Val Lys Pro 
35 40 45 

lie Glu Ser lie Lys Glu Lys Met Ala Arg Arg Gly lie Thr Tvr Ala 
M 50 55 60 

Thr Leu Glu His Asp Leu Gin Asp He Ala Gly Leu Arg Val Met Val 



100 
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65 70 75 



80 



Gin Phe Val Asp Asp Val Lys Glu Val Val Asp lie Leu His Lys Arg 
85 go 95 

Gin Asp Met Arg lie lie Gin Glu Arg Asp Tyr lie Thr His Arg Lys 
100 105 110 

Ala Ser Gly Tyr Arg Ser Tyr His Val Val Val Glu Tyr Thr Val Asp 
115 120 125 

Thr He Asn Gly Ala Lys Thr He Leu Ala Glu He Gin He Arg Thr 
130 135 140 

Leu Ala Met Asn Phe Trp Ala Thr He Glu His Ser Leu Asn Tyr Lys 
145 150 155 160 

Tyr Gin Gly Asp Phe Pro Asp Glu He Lys Lys Arg Leu Glu He Thr 
165. 170 175 

Ala Arg He Ala His Gin Leu Asp Glu Glu Met Gly Glu He Arg Asp 
180 185 190 

Asp He Gin Glu Ala Gin Ala Leu Phe Asp Pro Leu Ser Arg Lys Leu 
195 200 205 

Asn Asp Gly Val Gly Asn Ser Asp Asp Thr Asp Glu Glu Tyr Arg 
210 215 220 



<210> 161 
<211> 195 
<212> PRT- 

<213> Streptococcus pneumoniae 
<400> 161 

Met Glu Leu Asn Thr His Asn Ala Glu He Leu Leu Ser Ala Ala Asn 
15 10 15 

Lys Ser His Tyr Pro Gin Asp Glu Leu Pro Glu He Ala Leu Ala Gly 
20 25 30 

Arg Ser Asn Val Gly Lys Ser Ser Phe lie Asn Thr Met Leu Asn Arg 
35 40 45 

Lys Asn Leu Ala Arg Thr Ser Gly Lys Pro Gly Lys Thr Gin Leu Leu 

50 55 ... . 60 

Asn Phe Phe Asn He Asp Asp Lys Met Arg Phe Val Asp Val Pro Gly 
65 70 75 80 

Tyr Gly Tyr Ala Arg Val Ser Lys Lys Glu Arg Glu Lys Trp Gly Cys 
85 90 95 

Met He Glu Glu Tyr Leu Thr Thr Arg Glu Asn Leu Arg Ala Val Val 
. 100 105 HO 
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Ser Leu Val Asp Leu Arg His Asp Pro Ser Ala Asp Asp Val Gin Met 
115 120 125 

Tyr Glu Phe Leu Lys Tyr Tyr Glu lie Pro Val lie lie Val Ala Thr 
5 130 135 140 

Lys Ala Asp Lys He Pro Arg Gly Lys Trp Asn Lys His Glu Ser Ala 
145 150 155 160 

.10 He Lys Lys Lys Leu Asn Phe Asp Pro Ser Asp Asp Phe He Leu Phe 

165 170 175 

Ser Ser Val Ser Lys Ala Gly Met Asp Glu Ala Trp Asp Ala He Leu 
180 185 190 

Glu Lys Leu 
195 



20 <210> 162 
<211> 97 
<212> PRT 

<213> Streptococcus pneumoniae 
25 <400> 162 

Met Lys Thr Arg Lys lie Pro Leu Arg Lys Ser Val Val Ser Asn Glu 
15 10 15 

Val He Asp Lys Arg Asp Leu Leu Arg He Val Lys Asn Lys Glu Gly 
30 20 25 30 

Gin Val Phe He Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 
35 40 45 

35 He Lys Leu Asp Asn Ala Glu' Ala Leu Glu Ala Lys Lys Lys Lys Val 
50 55 60 

Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 

40 65 70 75 80 

Leu He Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 
85 90 95 

Glu 

45 



<210> 163 
<211> 103 
50 <212> PRT 

<213> Streptococcus pneumoniae 

<400£M63 

- Met Leu Lys Pro Ser He Asp Thr Leu Leu Asp Lys Val Pro Ser Lys 
55 1 5 io 15 

Tyr Ser Leu Val He Leu Glu Ala Lys Arg Ala His Glu Leu Glu Ala 
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20 25- 



30 



Gly Ma Pro Ala Thr Gin Gly Phe Lys Ser Glu Lys Ser Thr Leu Arg 
5 35 40 45 

Ala Leu Glu Glu lie Glu Ser Gly Asn Val Thr lie His Pro Asd Pro 
50 55 60 

in G « LyS Arg G1U Ala Val Arg Ar 9 Ar 9 Ile Glu G1 " Glu Lys Arg 

10 65 ™ 75 go 

Arg Lys Glu Glu Glu Glu Lys Lys Ile Lys Glu Gin lie Ala Lys Glu 
85 90 95 

15 Lys Glu Asp Gly Glu Lys Ile 
100 

<210> 164 
20 <211> 103 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 164 

25 Met Ser Leu Thr Ser Lys Gin Arg Ala Phe Leu Asn Ser Gin Ala His 
1 5 io 15 

Thr Leu Lys Pro lie Ile Gin Ile Gly Lys Asn Gly Leu Asn Asp Gin 
30 . 20 25 30 

Ile Lys Thr Ser Val Arg Gin Ala Leu Asp Ala Arg Glu Leu Ile Lys 
35 40 45 

_ Val Thr Leu Leu Gin Asn Thr Asp Glu Asn Ile His Glu Val Ala Glu 
- 0 50 55 . 60 

Ile Leu Glu Glu Glu Ile Gly Val Asp Thr Val Gin Lys lie Gly Arg 
65 70 75 eo 

40 He Leu Ile Leu Phe Lys Gin Ser Ser Lys Lys Glu Asn Arg Lys Ile 

85 90 95 

Ser Lys Lys Val Lys Glu Ile 
100 

45 

<210> 165 
<211> 175 
<212> PRT 
50 <213> Streptococcus pneumoniae 

<400> 165 

Met Ala Ile Glu Asn Tyr Ile Pro Asp Phe Ala Val Glu Ala Val Tyr 
55 1 • 5 .10 15 

Asp Leu Thr Val Pro Ser Leu Gin Ala Gin Gly Ile Lys Ala Val Leu 
20 25 30 
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Val Asp Leu Asp Asn Thr Leu He Ala Trp Asn Asn Pro Asp Glv Thr 
35 40 45 

5 Pro Glu Met Lys Gin Trp Leu His Asp Leu Arg Asp Ala Glv He Glv 
50 55 J y 



60 



lie lie Val Val Ser Asn Asn Thr Lys Lys Arg Val GlnArg Ala Val 

10 70 75 80 

Glu Lys Phe Gly He Asp Tyr Val Tyr Trp Ala Leu Lys Pro Phe Thr 
85 90 . 95 

Phe Gly lie Asp Arg Ala Met Lys Glu Phe His Tyr Asp Lys Lys Glu 
15 100 105 110 

Val Val Met Val Gly Asp Gin Leu Met Thr Asp lie Arg Ala Ala His 
115 ' 120 125 

20 Arg Ala Gly He Arg Ser He Leu Val Lys Pro Leu Val Gin His Asd 
130 135 i 4 o 

Ser He Lys Thr Gin He Asn Arg Thr Arg Glu Arg Arg Val Met Ara 
25 145 150 155 16 J 

Lys He Thr Glu Lys Tyr Gly Pro He Thr Tyr Lys Lys Gly He 
165 170 175 

30 <210> 166 
<211> 455 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 166 

Met Phe Arg Lys He Leu He Ala Asn Arg Gly Glu He Ala Val Arq 
1 5 io 15 

He He Arg Ala Ala Arg Glu Leu Gly He Ala Thr Val Ala Val Tvr 
W 20 25 30 

Ser Thr Ala Asp Lys Glu Ala Leu His Thr Leu Leu Ala Asp Glu Ala 
• 35 40 45 

45 Val Cys He Gly Pro Gly Lys Ala Thr Glu Ser Tyr Leu Asn He Asn 
50 55" 60 

Ala Val Leu Ser Ala Ala Val Leu Thr Glu Ala Glu Ala He His Pro 
50 " 70 75 B0 

Gly' Phe Gly Phe Leu Ser Glu Asn Ser Lys Phe Ala Thr Met Cys Glu 
85 90 95 

Glu He Gly He Lys Phe He Gly Pro Ser Gly His Val Met Asp Met 
1Q 0 105 no 

Met Gly Asp Lys He Asn Ala Arg Ala Gin Met He Lys Ala Gly Val 



104 
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115 120 125 

Pro Val lie Pro Gly Ser Asp Gly Glu Val His Asn Ser Glu Glu Ala 
5 130 135 140. 

Leu lie Val Ala Glu Lys He Gly Tyr Pro Val Met Leu Lys Ala Ser 
145 150 155 160 

Ala Gly Gly Gly Gly Lys Gly lie Arg Lys Val Glu Lys Pro Asp Asp 
1U 165 170 175 

Leu Val Ser Ala Phe Glu Thr Ala Ser Ser Glu Ala Lys Ala Asn Tvr 
180 185 190 

15 Gly Asn Gly Ala Met Tyr He Glu Arg Val He Tyr Pro Ala Arg His 
195 200 205 

He Glu Val Gin He Leu Gly Asp Glu His Gly His Val He His Leu ' 
210 215 220 

Gly Glu Arg Asp Cys Ser Leu Gin Arg Asn Asn Gin Lys Val Leu Glu 
225 230 • 235 • 240 

Glu Ser Pro' Ser He Ala He Gly Lys Thr Leu Arg His Glu He Glv 
245 250 255 

Ala Ala Ala Val Arg Ala Ala Glu Phe Val Gly Tyr Glu Asn Ala Glv 
260 265 270 

30 Thr He Glu Phe Leu Leu Asp Glu Ala Ser Ser Asn Phe Tyr Phe Met 
275 280 . 285 



20 



25 



35 



40 



Glu Met Asn Thr Arg Val Gin Val Glu His Pro Val Thr Glu Phe Val 
290 295 300 

Ser Gly Val Asp He Val Lys Glu Gin He Cys He Ala Ala Glv Gin 
305 31 ° 315 320 

Pro Leu Ser Val Lys Gin Glu Asp He Val Leu Arg Gly His Ala He 
325 330 335 

Glu Cys Arg He Asn Ala Glu Asn Pro Ala Phe Asn Phe Ala Pro Ser 
340 345 350 

■ 45 Pro Gly Lys He Thr Asn Leu Tyr Leu Pro Ser Gly Gly Val Gly Leu 
355 360 365 

Arg Val Asp Ser Ala Val Tyr Pro Gly Tyr Thr He Pro Pro Tyr Tyr 
50 375 380 

Asp Ser Met He Ala Lys He He Val His Gly Glu Asn Arg Phe Asp 
385 390 395 400 

Ala Leu Met Lys Met Gin Arg Ala Leu Tyr Glu Leu Glu He Glu Glv 
" 405 410 415 

Val - Gln Thr Asn Ala Asp Phe Gin Leu Asp Leu He Ser Asp Arg Asn 
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420 425 



430 



Val He Ala Gly Asp Tyr Asp Thr Cys Phe Leu Met Glu Thr Phe Leu 
5 435 440 445 

Pro Lys Tyr Gin Glu Lys Glu 
450 455 

10 <210> 167 
<211> 77 
<212> PRT 

<213> Streptococcus pneumoniae 
15 <400> 167 

Met He Tyr Lys Val Phe Tyr Gin Glu Thr Lys Glu Arg Ser Pro Arg 
1 5 10 15 

Arg Glu Thr Thr Arg Ala Leu Tyr Leu Asp He Asp Thr Ser Ser Glu 
20 20 25 30 

Leu Glu Gly Airg He Thr Ala Arg Gin Leu Val Glu Glu Asn Arg Pro 
35 40 45 

25 Glu Tyr Asn He Glu Tyr He Glu Leu Leu Ser Asp Lys Leu Leu Asd 
5 ° 55 60 

Tyr Glu Lys Glu Thr Gly Ala Phe Glu He Thr Glu Phe 
65 70 75 

30 

<210> 168 
<211> 336 
<212> PRT 
35 <213> Streptococcus pneumoniae 

<400> 168 

Met Lys Asp Arg Tyr lie Leu Ala Phe Glu Thr Ser Cys Asp Glu Thr 

40 1 ' 5 - 10 

Ser Val Ala Val Leu Lys Asn Asp Asp Glu Leu Leu Ser Asn Val He 
20 25 30 

Ala Ser Gin He Glu Ser His Lys Arg Phe Gly Gly Val Val Pro Glu 
45 35 40 45 

Val Ala Ser Arg His His Val Glu Val He Thr Ala Cys He Glu Glu 

50 55 ■-; 60 

50 Ala Leu Ala Glu Ala Gly He Thr Glu Glu Asp Val Thr Ala Val Ala 
65 70 75 80 

Val Thr Tyr Gly Pro Gly Leu Val Gly Ala Leu Leu Val Gly Leu Ser 
55 .85 90 95 

Ala Ala Lys Ala Phe Ala Trp Ala His Gly Leu Pro Leu He Pro Val 
100 105 no 
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Asn His Met Ala Gly His Leu Met Ala Ala Gin Ser Val Glu Pro Leu 
115 120 125 

5 Glu Phe Pro Leu Leu Ala Leu Leu Val Ser Gly Gly His Thr Glu Leu 
130 135 140 



10 



Val Tyr Val Ser Glu Ala Gly Asp Tyr Lys lie Val Gly Glu Thr Arq 

145 .150 155 , i 6 o 

Asp Asp Ala Val Gly Glu Ala Tyr Asp Lys Val Gly Arg Val Met Gly 
165 • 170 175 



Leu Thr Tyr Pro Ala Gly Arg Glu He Asp Glu Leu Ala His Gin Gly 
15 180 185 190 • 

Gin Asp He Tyr Asp Phe Pro Arg Ala Met He Lys Glu Asp Asn Leu 
195 200 205 

20 Glu Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe He Asn Leu His His 
210 215 220 

Asn Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr Glu Asp Leu Cys Ala 
25 225 230 . 235 240 

Ser Phe Gin Ala Ala Val Met Asp He Leu Met Ala Lys Thr Lys Lys 
245 250 255 

Ala Leu Glu Glu Tyr Pro Val Lys Thr Leu Phe Val Ala Gly Gly Val 
30 260 265 270 

Ala Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala Ala Glu He Thr Asp 
275 280 285 

35 Val Lys Val He He Pro Pro Leu Arg Leu Cys Gly Asp Asn Ala Gly 
■290 295 300 

Met He Ala Tyr Ala Ser Val Ser Glu Trp Asn Lys Glu Asn Phe Ala 
4Q 305 310 315 320 

Gly Trp Asp Leu Asn. Ala Lys Pro Ser Leu Ala Phe Asp Thr Met Glu 
325 330 335 

45 



<210> 169 

<211> 602 . . 

50 <212> PRT - ■ 

<213> Streptococcus pneumoniae 

<400> 169 

Met Cys Gly He Val Gly Val Val Gly Asn Thr Asn Ala Thr Asp He 
55 < 1 5 . 10 15 

Leu He Gin Gly Leu Glu Lys Leu Glu Tyr Arg Gly Tyr Asp Ser Ala 
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20 25 3 0 

Gly lie Phe Val Leu Asp Gly Ala Asp Asn His Leu Val Lys Ala Val 



40 



45 



Gly Arg lie Ala Glu Leu Ser Ala Lys Thr Ala Gly Val Glu Gly Thr 



50 55 60 



10 65 



Thr Gly lie Gly His Thr Arg Trp Ala Thr His Gly Lys Pro Thr Glu 



70 



75 



80 



Asp Asn Ala His Pro His Arg Ser Glu Thr Glu Arg Phe Val Leu Val 
. 85 90 95 

15 His Asn Gly Val lie Glu Asn Tyr Leu Glu He Lys Glu Glu Tyr Leu 
100 105 • 110 



20 



Ala Gly His His Phe Lys Gly Gin Thr Asp Thr Glu He Ala Val His 
115 120 125 

Leu He Gly Lys Phe Ala Glu Glu Glu Gly Leu Ser Val Leu Glu Ala 
130 135 140 



25 145 



Ph? Lys Lys Ala Leu His He He Arg Gly Ser Tyr Ala Phe Ala Leu 



150 



155 160 
He Asp Ser Glu Asn Pro Asp Val He Tyr Val Ala Lys Asn Lys Ser 



165 



170 



175 



30 Pro Leu Leu He Gly Leu Gly Glu Gly Tyr Asn Met Val Cys Ser Asd 
180 * 



185 



190 



Ala Met Ala Met He Arg Glu Thr Asn Gin Tyr Met Glu He His Asp 



35 



200 



205 



Gin Glu Leu Val He Val Lys Ala Asp Ser Val Glu Val Gin Asp Tyr 



210 215 220 



dn oo? Gly Asn Ser Arg Glu Ala Ser T ^ r Thr Ala Glu Leu Asp Leu 
U 225 230 235 240. 



Ser Asp He Gly Lys Gly Thr Tyr Pro' Tyr Tyr Met Leu Lys Glu He 

245 250 255 

45 Asp Glu Gin Pro Thr Val Met Arg Lys Leu He Gin Ala Tyr Thr Asp 
260 265 270 



50 



Asp Ala Gly Gin Val Val Val Ala Pro Ala He He Lys Ala Val Gin 
275 280 285 

Asp Ala Asp Arg He Tyr He Leu Ala Ala Gly Thr Ser Tyr His Ala 
290 295 300 



55 305 



Gly Phe Ala Ser Lys Lys Met Leu Glu Glu Leu Thr Asp Thr Pro Val 



310 315 . 320 

Glu Leu Gly He Ser Ser Glu Trp Gly Tyr Gly Met Pro Leu Leu Ser 
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325 330 335 

Lys Lys Pro Leu Phe He Phe He Ser Gin Ser Gly Glu Thr Ala Asp 
5 340 345 350 

Ser Arg Gin Val Leu Val Lys Ala Asn Glu Met Gly He Pro Ser Leu 
355 360 365 ' 

Thr Val Thr Asn Val Pro Gly Ser Thr Leu Ser Arg Glu Ala Asn Tvr 
10 370 375 380 

Thr Met Leu Leu His Ala Gly Pro Glu He Ala Val Ala Ser Thr Lys 
385 390 395 4 00 

15 Ala Tyr Thr Ala Gin He Ala Ala Leu Ala Phe Leu Ala Lys Ala Val 

405 410 415 

Gly Glu Ala Asn Gly Asn Ala Lys Ala Gin Ala Phe Asp Leu Val His 
2Q 420 425 430 

Glu Leu Ser He Val Ala Gin Ser He Glu Ser Thr Leu Ser Glu Lys 
435 440 445 

Glu Thr He Glu Ala Lys Val Arg Glu Leu Leu Glu Thr Thr Arg Asn 
25 450 455 460 

Ala Phe Tyr He Gly Arg Gly Gin Asp Tyr Tyr Val Ala Met Glu Ala 
465 470 475 48 0 

30 Ser Leu Lys Leu Lys Glu He Ser Tyr He Gin Cys Glu Gly Phe Ala 

485 490 495 

Ala Gly Glu Leu Lys His Gly Thr He Ala Leu He Glu Glu Gly Thr 
35 500 505 510 

Pro Val Leu Ala Leu Leu Ser Asp Pro Val Leu Ala Asn His Thr Arg , 
515 520 525 

Gly Asn He Gin Glu Val Ala Ala Arg Gly Ala Lys Val Leu Thr lie ' 
40 530 535 540 

Ala Glu Glu Asn Val Ala Lys Asp Thr Asp Asp He Val Leu Thr Thr 
545 550 ' 555 5 60 

45 Val His Pro Tyr Leu Ser Pro He Ser Met Val Val Pro Thr Gin Leu 

565 570 • 575 

Val Ala Tyr Phe Ala Thr Leu His Arg Gly Leu Asp Val Asp Lys Pro 
5Q 580 58.5 590 

Arg Asn Leu Ala Lys Ser Val Thr Val Glu 
595 600 

55 <210> 170 
<211> 240 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 170 

s Met lie Arg lie Glu Asn Leu Ser Val Ser Tyr Lys Glu Thr Leu Ala 

10 15 



Leu Lys Asp lie Ser Leu Val Leu His Gly Pro Thr lie Thr Gly lls 

10 lie Gly Pro Asn Gly Ala Gly Lys Ser Thr Leu Leu Lys Gly Met Leu 
J5 40 45 

Gly lie He Pro His Gin Gly Gin Ala Phe Leu Asp Asp Lys Glu Val 



15 



55 60 



Lys Lys Ser Leu His Arg lie Ala Tyr Val Glu Gin Lys He Asn lie 

70 75 



80 

Asp Tyr Asn Phe Pro lie Lys Val Lys Glu Cys Val Ser Leu Gly Leu 
85 90 95 



Phe Pro Ser lie Pro Leu Phe Arg Ser Leu Lys Ala Lys His Tr P Lys 

105 110 

25 Lys Val Gin Glu Ala Leu Glu lie Val Gly Leu Ala Asp Tyr Ala Glu 



125 



Arg Gin He Ser Gin Leu Ser Gly Gly Gin Phe- Gin Arg Val Leu He 
30 135 140 

Ala Arg Cys Leu Val Gin Glu Ala Asp Tyr He Leu Leu Asp Glu Pro 

X0D 160 



35 



Phe Ala Gly He Asp Ser Val Ser Glu Glu lie lie Met Asn Thr Leu 
165 170 



175 



Arg Asp Leu Lys Lys Ala Gly Lys Thr Val Leu lie Val His His Asp 
1WU 185 



190 



40 Leu Ser Lys He Pro His Tyr Phe Asp Gin Val Leu Leu Val Asn Arg 
° 200 205 



Glu Val lie Ala Phe Gly Pro Thr Lys Glu Thr Phe Thr Glu Thr Asn 



45 220 

Leu Lys Glu Ala Tyr Gly Asn Gin Leu Phe Phe Asn Gly Gly Asp 

230 .. 235 

50 



Leu 
240 



<210> 171 
<211> 740 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 171 

Met Pro Lys Glu Val Asn Leu Thr Gly Glu Glu Val Val Ala Leu Thr 
15 10 15 

5 Lys Glu Tyr Leu Thr Glu Glu Asp Val His Phe Val His Lys Ala Leu 
20 25 * Q 



10 



Val Tyr Ala Val Glu Cys His Ser Gly Gin Tyr Arg Lys Ser Gly Glu 
35 40 45 

Pro Tyr He lie His Pro lie Gin Val Ala Gly He Leu Ala Lys Leu 
50 "55 60 - 

15 L 65 Ma ™J Ma CyS GlY PhS LeU His As P Val Val 



80 



Glu Asp Thr Asp Ala Thr Leu Asp Asp Leu Glu Arg Glu Phe Gly Pro 
85 90 95 

20 Asp Val Arg Val lie Val Asp Gly Val Thr Lys Leu Gly Lys Val Glu 
100 105 110 

Tyr Lys Ser He Glu Glu Gin Leu Ala Glu Asn His Arg Lys Met Leu 
25 115 120 I 25 

Met Ala Met Ser Glu Asp. He Arg Val lie- Leu Val Lys Leu Ser Asp 
1JU 135 140 

30 M6t ^ Thr L6U LyS His Leu Ar 9 Asp Gin 

3 150 155 i 6 o 

Glu Arg He Ser Lys Glu Thr Met Glu He Tyr Ala Pro Leu Ala His 
165 170 175 

35 Arg Leu Gly He Ser Ser Val Lys Trp Glu Leu Glu Asp Leu Ser Phe 
180 i 8 5 190 

Arg Tyr Leu Asn Pro Thr Glu Phe Tyr Lys He Thr His Met Met Lys 
40 i95 200 205 7 

Glu Lys Arg Arg Glu Arg Glu Ala Leu Val Asp Glu Val Val Thr Lvs 
21° 215 220 

45 w G1U ^ ^ oo r G1U Arg His LeU ^ He Tyr Gly 

225 230 235 240 

Arg Pro Lys His He Tyr Ser He Phe -Arg Lys Met Gin Asp Lys Arg 
245 250 255 

50 Lys Arg Phe Glu Glu He Tyr Asp Leu He Ala lie Arg Cys He Leu 
260 265 270 

Asp Thr "Gin Ser Asp Val Tyr Ala Met Leu Gly Tyr Val His Glu Phe 
55 275 280 285 

Trp Lys Pro Met Pro Gly Arg Phe Lys Asp Tyr lie Ala Asn Arg Lys 
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10 



15 



25 



30 



Ala Asn Gly Tyr Gin Ser He His Thr Thr Val Tyr Gly Pro Lys Glv 
305 310 315 



320 



5 Pro He Glu Phe Gin II 



e Arg Thr Lys Glu Met His Glu Val Ala Glu 
325 330 335 

Tyr Gly Val Ala Ala His Trp Ala Tyr Lys Lys Gly He Lys Gly Gin 
340 345 350 

Val Asn Ser Lys Glu Ser Ala He Gly Met Asn Trp He Lys Glu Net 
355 360 365 

Met Glu Leu Gin Asp Gin Ala Asp Asp Ala Lys Glu Phe Val Asp Ser 
37 ° 375 380 

Val Lys Glu Asn Tyr Leu Ala Glu Glu He Tyr Val Phe Thr Pro Asp 
385 ° nn — — 



390 



395 



400 



20 Gly Ala . Val Arg Ser Leu Pro Lys Asp Ser Gly Pro He Asp Phe Ala 

405 4io 415 

Tyr Glu He His Thr Lys Val Gly Glu Lys Ala Thr Gly Ala Lys Val 
420 425 430 

Asn Gly Arg Met Val Pro Leu Thr Thr Lys Leu Lys Thr Gly Asp Gin 



435 



440 



445 



Val Glu He lie Ala Asn Pro Asn Ser Phe Gly Pro Ser Arg Asp Tro 
450 455 460 



Leu Asn Met Val Lys Thr Ser Lys Ala Arg Asn Lys He Arg Gin Phe 
465 4? 0 475 480 

35 Phe Lys Asn Gin Asp Lys Glu Leu Ser Val Asn Lys Gly Arg Glu Met 

485 490 495 



40 



45 



Leu Met Ala Gin Phe Gin Glu Asn Gly Tyr Val Ala Asn Lys Phe Met 
500 505 510 

Asp Lys Arg His Met Asp Gin Val Leu Gin Lys Thr Ser Tyr Lys Thr 
515 520 525 

Glu Asp Ser Leu Phe Ala Ala lie Gly Phe Gly Glu He Gly Ala He 



530 



535 



540 



Thr Val Phe Asn Arg Leu Thr Glu Lys Glu Arg Arg Glu Glu Glu Ara 
545 550 555 5 60 

50 Ala Lys Ala Lys Ala Glu Ala Glu Glu Leu Val Lys Gly Gly Glu Val 

565 570 575 



55 



Lys Val Glu Asn Lys Glu Thr Leu Lys Val Lys His Glu Gly Gly Val 
580 585 590 

Val lie Glu Gly Ala Ser Gly Leu Leu Val Arg lie Ala Lys Cys Cys 
595 600 605 
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Asn Pro Val Pro Gly Asp Asp lie Val Gly Tyr lie Thr Lys Gly Arq 
610 615 620 

5 Gly Val Ala lie His Arg Val Asp Cys Met Asn Leu Arg Ala Gin Glu 
625 630 635 640 

Asn Tyr Glu Gin Arg Leu Leu Asp Val Glu Trp Glu Asp Gin Tyr Ser 
10 645 65 ° 655 

Ser Ser Asn Lys Glu Tyr Leu Ala His lie Asp lie Tyr Gly Leu Asn 
660 665 670 

Arg Thr Gly Leu Leu Asn Asp Val Leu Gin Val Leu Ser Asn Thr Thr 
15 675 680 685 

Lys Asn He Ser Thr Val Asn Ala Gin Pro Thr Lys Asp Met Lys Phe 
690 695 700 

20 Ala Asn lie His Val Ser Phe Gly He Ala Asn Leu Ser Thr Leu Thr 
705 710 715 720 

Thr Val Val Asp Lys He Lys Ser Val Pro Glu Val Tyr Ser Val Lys 

25 

Arg Thr Asn Gly 
740 



725 730 735 



30 <210> 172 
<211> 492 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 172 

Met Ser Asn Trp Asp Thr Lys Phe Leu Lys Lys Gly Phe Thr Phe Asp 
1 5 10 15 

Asp Val Leu Leu lie Pro Ala Glu Ser His Val Leu Pro Asn Asp Ala 
W 20 25 30 

Asp Leu Thr Thr Lys Leu Ala Asp Asn Leu Thr Leu Asn He Pro He 
35 40 45 

45 He Thr Ala Ala Met Asp Thr Val Thr Glu Ser Gin Met Ala He Ala 
50 55 60 

lie Ala Arg Ala Gly Gly Leu Gly Val He His Lys Asn Met Ser He 
50 " 70 75 80 

Ala Gin Gin Ala Asp Glu Val- Arg Lys Val Lys Arg Ser Glu Asn Glv 
85 90 95 y 

Val He He Asp Pro Phe Phe Leu Thr Pro Glu His Thr He Ala Glu 
33 100 105 no 

Ala Asp Glu Leu Met Gly Arg Tyr Arg He Ser Gly Val Pro Val Val 
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115 



120 



125 



Glu Thr Leu Glu Asn Arg Lys Leu Val Gly lie Leu Thr Asn Arq Asp 
5 130 • 135 140 

Leu Arg Phe He Ser Asp Tyr Asn Gin Pro He Ser Asn His Met Thr 
145 150 155 i 6 o 

Ser Glu Asn Leu Val Thr Ala Pro Val Gly Thr Asp Leu Ala Thr Ala 
1U 165 170 i7 5 

Glu Ser He Leu Gin Glu His Arg He .Glu Lys Leu Pro Leu Val Asp 
180 185 190 

15 Glu Glu Gly Ser Leu Ser Gly Leu He Thr He Lys Asp He Glu Lvs 
195 200 9nc; 



205 

Val He Glu Phe Pro Asn Ala Ala Lys Asp Glu Phe Gly Arg Leu Leu 
210 215 - 220 

Val Ala Gly Ala Val Gly Val Thr Ser Asp Thr Phe Glu Arg Ala Glu 
225 23 0 235 240 

Ala Leu Phe Glu Ala Gly Ala Asp Ala He Val He Asp Thr Ala His 
245 250 255 

Gly His Ser Ala Gly Val Leu Arg Lys He Ala Glu He Arg Ala His 
260 265 270 

30 Phe Pro Asp Arg Thr Leu lie Ala Gly Asn He Ala Thr Ala Glu Gly 
275 280 285 



20 



25 



35 



Ala Arg Ala Leu Tyr Glu Ala Gly Val Asp Val Val Lys Val Gly He 
290 295 300 

305 - « 116 Ala Gly Val Gly Val 



310 



315 



320 



Pro Gin Val Thr Ala lie Tyr Asp Ala Ala Ala Val Ala Arg Glu Tyr 
40 325 330 3 35 

Gly Lys Thr He He Ala Asp Gly Gly He Lys Tyr Ser Gly Asp He 
340 345 350 

45 Val Lys Ala Leu Ala Ala Gly Gly Asn Ala Val Met Leu Gly Ser Met 
355 360 365 



50 



55 



Phe Ala Gly Thr Asp Glu Ala Pro Gly Glu Thr Glu He Phe Gin Glv 
370 375 380 

Arg Lys Phe Lys Thr Tyr Arg Gly Met Gly Ser lie Ala Ala Met Lvs 
385 390 395, 400 

Lys Gly Ser Ser Asp Arg Tyr Phe Gin Gly Ser Val Asn Glu Ala Asn 
405 410 415 

Lys Leu Val Pro Glu Gly He Glu Gly Arg Val Ala Tyr Lys Gly Ala 
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420 • 425 



430 



5 



15 



20 



30 



Ala Ala Asp He Val Phe Gin Met He Gly Gly lie Arg Ser Gly Met 
435 440 • 445 

Gly Tyr Cys Gly Ala Ala Asn Leu Lys Glu Leu His Asp Asn Ala Gin 
450 455 460 



Phe He Glu Met Ser Gly Ala Gly Leu Lys Glu Ser His Pro His Asp 
1U 465 470 475 „on 



475 

Val Gin He Thr Asn Glu Ala Pro Asn Tyr Ser Met * 
485 490 



480 



<210> 173 . . 
<211> 648 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 173 

Met Thr Glu Glu He Lys Asn Leu Gin Ala Gin Asp Tyr Asp Ala Ser 
1 5 10 15 

25 Gin He Gin Val Leu Glu Gly Leu Glu Ala Val Arg Met Arg Pro Glv 
20 25 30 



Met Tyr He Gly Ser Thr Ser Lys Glu Gly Leu His His Leu Val Trp 
35 40 45 

Glu He Val Asp Asn Ser He Asp Glu Ala Leu Ala Gly Phe Ala Ser 
50 55 60 



His He Gin Val Phe He Glu Pro Asp Asp Ser He Thr Val Val Asp 
35 65 7 0 75 so 

Asp Gly Arg Gly lie Pro Val Asp He Gin Glu Lys Thr Gly Arg Pro 
85 90 95 

40 Ala Val Glu Thr Val Phe Thr Val Leu His Ala Gly Gly Lys Phe Glv 
100 105 no 

Gly Gly Gly Tyr Lys Val Ser Gly Gly Leu His Gly Val Gly Ser Ser 
45 115 120 125 

Val Val Asn Ala Leu Ser Thr Gin Leu Asp Val His Val His Lys Asn 
130 135 140 

Gly Lys He His Tyr Gin Glu Tyr Arg Arg Gly His Val Val Ala Asp 
50 145 150 155 . leo 

Leu Glu lie Val Gly Asp Thr Asp Lys Thr Gly Thr Thr Val His Phe 
165 170 175 

55 Thr Pro Asp Pro Lys He Phe Thr Glu Thr Thr He Phe Asp Phe Asp 
180 185 190 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Lys Leu Asn Lys Arg He Gin Glu Leu Ala Phe Leu Asn Arg Gly Leu 
195 ■ 200 205 

Gin He Ser He Thr Asp Lys Arg Gin Gly Leu Glu Gin Thr Lys His 
210 215 220 

Tyr His Tyr Glu Gly Gly He Ala Ser Tyr Val Glu Tyr He Asn Glu 
225 230 235 240 

Asn Lys Asp Val He Phe Asp Thr Pro lie Tyr Thr Asp Gly Glu Met 
245 250 255 

Asp Asp lie Thr Val Glu Val Ala Met Gin Tyr Thr Thr Gly Tyr His 
260 265 270 

Glu Asn Val Met Ser Phe Ala Asn Asn He His Thr His Glu Gly Gly 
275 280 285 

Thr His Glu Gin Gly Phe Arg Thr Ala Leu Thr Arg Val He Asn Asp 
290 295 300 

Tyr Ala Arg Lys Asn Lys Leu Leu Lys Asp Asn Glu Asp Asn Leu Thr 
305 310 315 ' 320 

Gly Glu Asp Val Arg Glu Gly Leu Thr Ala Val He Ser Val Lys His 
325 330 335 

Pro Asn Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Ser 
340 345 350 

Glu Val Val Lys He Thr Asn Arg Leu Phe Ser Glu Ala Phe Ser Asp 
355 360 • 365 

Phe Leu Met Glu Asn Pro Gin lie Ala Lys Arg lie Val Glu Lys Glv 
370 375 380 

He. Leu Ala Ala Lys Ala Arg Val Ala Ala Lys Arg Ala Arg Glu Val 
385 * 390 395 400 

Thr Arg Lys Lys Ser Gly Leu Glu He Ser Asn Leu Pro Gly Lys Leu 
405 410 415 

Ala Asp Cys Ser Ser Asn Asn Pro Ala Glu Thr Glu Leu Phe lie Val 
420 425 430 

Glu Gly Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asn Arg Glu 
435 440 - 445 

Phe Gin Ala lie Leu Pro lie Arg Gly Lys lie Leu Asn Val Glu Lys 
4 50 455 4 60 . 

Ala Ser Met Asp Lys lie Leu Ala Asn Glu Glu lie Arg Ser Leu Phe 
465 470 475 480 

Thr Ala Met Gly Thr Gly Phe Gly Ala Glu Phe Asp Val Ser Lys Ala 
485 490 495 
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Arg Tyr Gin Lys Leu Val Leu Met Thr Asp Ala Asp Val Asp Gly Ala 
500 505 510 

His lie Arg -Thr Leu Leu Leu Thr Leu He Tyr Arg Tyr Met Lys Pro 
5 515 520 525 

He Leu Glu Ala Gly Tyr Val Tyr He Ala Gin Pro Pro He Tyr Glv 
530 535 54 0 

10 Val Lys Val Gly Ser Glu He Lys Glu Tyr He Gin Pro Gly Ala Asp 
545 550 555 56 £ 

Gin Glu He Lys Leu Gin Glu Ala Leu Ala Arg Tyr Ser Glu Gly Arg 
565 570 575 

Thr Lys Pro Thr He Gin Arg Tyr Lys Gly Leu Gly Glu Met Asp Asp 
580 585 590 

His Gin Leu Trp Glu Thr Thr Met Asp Pro Glu His Arg Leu Met Ala 
20 595 600 605 

Arg Val Ser Val Asp Asp Ala Ala Glu Ala Asp Lys lie Phe Asp Met 
610 . 615 620 

25 Leu Met Gly Asp Arg Val Glu Pro Arg Arg Glu Phe He Glu Glu Asn 
625 630 635 640 

Ala Val Tyr Ser Thr Leu Asp Val 
645 

30 

<210> 174 
<211> 88 
<212> PRT 
35 <213> Streptococcus pneumoniae 

<400> 174 

Met Gly Phe Thr Glu Glu Thr Val Arg Phe Lys Leu Asp Asp Ser Asn 

40 1 5 10 

Lys Lys Glu He Ser Glu Thr Leu Thr Asp Val Tyr Ala Ser Leu Asn 
20 25 30 

Asp Lys Gly Tyr Asn Pro He Asn Gin He Val Gly Tyr Val Leu Ser 
45 -35 40 45 

Gly Asp Pro Ala Tyr Val Pro Arg Tyr Asn Asn Ala Arg Asn Gin He 
50 55 60 

50 Arg Lys Tyr Glu Arg Asp Glu He Val Glu Glu Leu Val Arg Tyr Tyr 
65 70 75 80 

Leu Lys Gly Gin Gly Val Asp Leu 
85 

55 

<210> 175 
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<211> 198 
<212> PRT 

<213> Streptococcus pneumoniae 
5 <400> 175 

Met Val Asn Tyr Pro His Lys Val Ser Ser Gin Asp Arg Gin Thr Ser 
1 5 10 15 

Leu Ser Gin Pro Lys Asn Phe Ala Asn Arg Gly Met Ser Phe Glu Lvs 
10 20 25 30 

Met He Asn Ala Thr Asn Asp Tyr Tyr Leu Ser Gin Gly Leu Ala Val 
35 40 45 

15 He His Lys Lys Pro Thr Pro He Gin He Val Gin Val Asp Tyr Pro 
50 55 .60 

Gin Arg Ser Arg Ala Lys He Val Glu Ala Tyr Phe Arg Gin Ala Ser 

20 " 

Thr Thr Asp Tyr Ser Gly Val Tyr Asn Gly Tyr Tyr He Asp Phe Glu 
85 90 95 

Val Lys Glu Thr Lys Gin Lys Arg Ala He Pro Met Lys Asn Phe His 
& 100 105 no 

Pro His Gin He Gin His Met Glu Gin Val Leu Ala Gin Gin Gly He 
115 120 125 

30 Cys Phe Val Leu Leu His Phe Ser Ser Gin Gin Glu Thr Tyr Leu Leu 
130 135 140 



35 



Pro Ala Phe Asp Leu He Arg Phe Tyr His Gin Asp Lys Gly Gin Lvs 
145 150 155 uo 

Ser Met Pro Leu Glu Tyr He Arg Glu Tyr Gly Tyr Glu He Lys Ala 
165 170- 175 



Gly Ala Phe Pro Gin He Pro Tyr Leu Asn Val He Lys Glu His Leu 
40 180 185 190 

Leu Gly Gly Lys Thr Arg 
195 

45 

<210> 176 
<211> 288 
<212> PRT 
50 < ^13 > Streptococcus pneumoniae 

<400> 176 

Met Ala Leu Phe Ser Lys Lys Asp Lys Tyr He Arg He Asn Pro Asn 
1 5 10 15 

55 Arg Ser Val Arg Glu Lys Pro Gin Ala Lys Pro Glu Val Pro Asp Glu 
20 25 30 
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Leu Phe Ser Gin Cys Pro Gly Cys Lys His Thr He Tyr Gin Lys Asp 
35 40 45 

Leu Gly Ser Glu Arg lie Cys Pro His Cys Ser Tyr Thr Phe Arq He 
■> 50 55 60 

Ser Ala Gin Glu Arg Leu Ala Leu Thr He Asp Met Gly Thr Phe Lys 
65 70 75 80 

10 Glu Leu Phe Thr Gly He Glu Ser Lys Asp Pro Leu His Phe Pro Gly 

85 90 95 

Tyr Gin Lys Lys Leu Ala Ser Met Arg Glu Lys Thr Gly Leu His Glu 
I5 100 105 no . 

Ala Val Val Thr Gly Thr Ala Leu He Lys Gly Gin Thr Val Ala Leu 
115 120 125 

Gly He Met Asp Ser Asn Phe He Met Ala Ser Met Gly Thr Val Val 
M 130 135 140 

Gly Glu Lys He Thr Arg Leu Phe Glu Tyr Ala Thr Val Glu Lys Leu 
145 15 <> 155 Y 160 

'25 Pro Val Val Leu Phe Thr Ala Ser Gly Gly Ala Arg Met Gin Glu Gly 

165 170 175 

He Met Ser Leu Met Gin Met Ala Lys He Ser Ala Ala Val Lys Ara 
30 180 185 190 

- His Ser Asn Ala Gly Leu Phe Tyr Leu Thr He Leu Thr Asp Pro Thr 
195 200 205 

Thr Gly Gly Val Thr Ala Ser Phe Ala Met Glu Gly Asp He He Leu 
210 215 220 

Ala Glu Pro Gin Ser Leu Val Gly Phe Ala Gly Arg Arg Val He Glu 
225 230 235 240 

40 Asn Thr Val Arg Glu Ser Leu Pro Glu Asp Phe Gin Lys Ala Glu Phe 

245 250 255 - 

Leu Leu Glu His Gly Phe Val Asp Ala He Val Lys Arg Arg Asp Leu 
45 260 265 270 

Pro Asp Thr He Ala Ser Leu Val Arg Leu His Gly Gly Ser Pro Ara 
275 280 285 

50 



<210> 177 
<211> 139 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 177 



Met Arg He Met Gly Leu Asp Val Gly Ser Lys Thr Val Gly Val Ala 

1 5 10 15 

5 He Ser Asp Pro Leu Gly Phe Thr Ala "Gin Gly Leu Glu He He Gin 

20 25 30 



10 



25 



He Asn Glu Glu Gin Gly Gin Phe Gly Ser Asp Arg Val Lys Glu Leu 

35 40 45 ' 

Val Asp Thr Tyr Lys Val Glu Arg Phe Val Val Gly' Leu Pro Lys Asn 
50 55 60 



_ Met Asn Asn Thr Ser Gly Pro Arg Val Glu Ala Ser Gin Ala Tyr Glv 
^ 65 70 75 



80 



Ala Lys Leu Glu Glu Phe Phe Gly Leu Pro Val Asp Tyr Gin Asp Glu 
85 90 95 

20 Arg Leu Thr Thr Val Ala Ala Glu Arg Met Leu lie Glu Gin Ala Asp 
100 105 no 



He Ser Arg Asn Lys Arg Lys Lys Val He Asp Lys Leu Ala Ala Gin 
115. 120 ( 125 

Leu He Leu Gin Asn Tyr Leu Asp Arg Lys Phe 
130 135 



30 <210> 178 
- <211> 398 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 178 

Met Ala Lys Leu Thr Val Lys Asp Val Asp Leu Lys Gly Lys Lys Val 
1 5 10 15 

■ Leu Val Arg Val Asp Phe Asn Val Pro Leu Lys Asp Gly Val He Thr 
40 20 25 30 

Asn Asp Asn Arg He Thr Ala Ala Leu Pro Thr He Lys Tyr He He 
35 40 45 

45 Glu Gin Gly Gly Arg Ala lie Leu Phe Ser His Leu Gly Arg Val Lys 
50 55 60 



Glu Glu Ala Asp Lys Ala Gly Lys Ser" Leu Ala Pro Val Ala Ala Asd 
50 " 70 75 8? 



Leu Ala Ala Lys Leu Gly Gin Asp Val Val Phe Pro Gly Val Thr Arq 
.85 '90 95 

Gly Ala. Glu Leu Glu Ala Ala lie Asn Ala Leu Glu Asp Gly Gin Val 

^ ioo 105 no 

Leu Leu Val Glu Asn Thr Arg Tyr Glu Asp Val Asp Gly Lys Lys Glu 
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115 



120 



125 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Ser Lys Asn Asp Pro Glu Leu Gly Lys Tyr Trp Ala Ser Leu Gly Asp 
130 135 140 

Gly lie Phe Val Asn Asp Ala Phe Gly Thr Ala His Arg Ala His Ala 
145 150 155 160 

Ser Asn Val Gly lie Ser Ala Asn Val Glu Lys Ala Val Ala Gly Phe 
165 170 175 

Leu Leu Glu Asn Glu lie Ala Tyr lie Gin Glu Ala Val Glu Thr Pro 
180 185 190 

Glu Arg Pro Phe Val Ala He Leu Gly Gly Ser Lys Val Ser Asp Lys 
195 200 205 

lie Gly Val lie Glu Asn Leu -Leu Glu Lys Ala Asp Lys Val Leu He 
210 215 220 

Gly Gly Gly Met Thr Tyr Thr Phe Tyr Lys Ala Gin Gly lie Glu He 
225 230 235 240 

Gly Asn Ser Leu Val Glu Glu Asp Lys Leu Asp Val Ala Lys Ala Leu 
245 ,250 - 255 

Leu Glu Lys Ala Asn Gly Lys Leu lie Leu Pro Val Asp Ser Lys Glu 
260 265 270 

Ala Asn Ala Phe Ala Gly Tyr Thr Glu Val Arg Asp Thr Glu Gly Glu 
275 280 285 

Ala Val Ser Glu Gly Phe Leu Gly Leu Asp He Gly Pro Lys Ser He 
290 295 300 

Ala Lys Phe Asp Glu Ala Leu Thr Gly Ala Lys Thr Val Val Trp Asn 
305 3 1° 315 320 

Gly Pro Met Gly Val Phe Glu Asn Pro Asp Phe Gin Ala Gly Thr He 
325 330 335 

Gly Val Met Asp Ala He Val Lys Gin Pro Gly Val Lys Ser He He 
340 345 350 

Gly Gly Gly Asp Ser Ala Ala Ala Ala He Asn Leu Gly Arg Ala Asp 
355 360 365 

Lys Phe Ser Trp He Ser Thr Gly Gly Gly Ala Ser Met Glu Leu Leu 
37 0 375 380 

Glu Gly Lys Val Leu Pro Gin Leu Ala Ala Leu Thr Glu Lys 
385 390 395 



55 <210> 179 
<211> 165 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 179 

Met Leu Lys Ser Glu Lys Gin Ser Arg Tyr Gin Met Leu Asn Glu Glu 
1 5 10 15 

Leu Ser Phe Leu Leu Glu Gly Glu Thr Asn Val Leu Ala Asn Leu Ser 
20 25 30 

10 Asn Ala Ser Ala Leu He Lys Ser Arg Phe Pro Asn Thr Val Phe Ala 
35 40 45 

Gly Phe Tyr Leu Phe Asp Gly Lys Glu Leu Val Leu Gly Pro Phe Gin 
15 55 60 

Gly Gly Val Ser Cys lie Arg lie Ala Leu Gly Lys Gly Val Cys Gly 
65 70 7s a i 



80 



Glu Ala Ala His Phe Gin Glu Thr Val lie Val Gly Asp Val Thr Thr 
85 90 g5 

Tyr Leu Asn Tyr He Ser Cys Asp Ser Leu Ala Lys Ser Glu He Val 
100 105 110 

25 Val Pro Met Met Lys Asn Gly Gin Leu Leu Gly Val Leu Asp Leu Asp 
115 120 125 * P 

Ser Ser Glu lie Glu Asp Tyr Asp Ala Met Asp Arg Asp Tyr Leu Glu 
30 XJU I 35 140 

Gin Phe Val Ala He Leu Leu Glu Lys Thr Ala Trp Asp Phe Thr Met 
45 , 150 155 160 

Phe Glu Glu Lys Ser ' 
35 165 



<210> 180 • 

<211> 209 ■• 
40 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 180 

Met Thr He Glu Leu Leu Thr Pro Phe Thr Lys Val Glu Leu Glu Pro 
1 5 10 15 

Glu He Lys Glu Lys Lys Arg Lys Gin .Val Gly He Leu Gly Gly Asn 
20 25 30 

50 Phe Asn Pro Val His Asn Ala His Leu He Val Ala Asp Gin Val Ara 
35- 40 45 a 

Gin Gin Leu Gly Leu Asp Gin Val Leu Leu Met Pro Glu Tyr Gin Pro 

55 



50 55 60 



Pro His Val Asp Lys Lys Glu Thr He Pro Glu His His Arg Leu Lys 
65 70 75 £ 0 



122 



WO 01/49721 



PCT/USOO/35604 



Met Leu Glu Leu Ala He Glu Gly He Asp Gly Leu Val He Glu Thr 
85 90 95 

5 He Glu Leu Glu Arg Lys Gly He Ser Tyr Thr Tyr Asp Thr Met Lvs 
100 105 no 

He Leu Thr Glu Lys Asn Pro Asp Thr Asp Tyr Tyr Phe He He Glv 
10 115 120 125 

Ala Asp Met Val Asp Tyr Leu Pro Lys Trp Tyr Arg lie Asp Glu Leu 
130 135 . • 140 

Val Asp Met Val Gin Phe Val Gly Val Gin Arg Pro Arg Tyr Lvs Val 
15 145 150 155 " i 6 o 

Gly Thr Ser Tyr Pro Val He Trp Val Asp Val Pro Leu Met Asp He 
165 170 175 

20 Ser Ser Ser Met Val Arg Ala Phe Leu Ala Gin Gly Arg Lys Pro Asn 
180 185 190 

Phe Leu Leu Pro Gin Pro Val Leu Asp Tyr lie Glu Lys Glu Gly Leu 
25 195 200 205 

Tyr 



30 <210> 181 
<211> 255 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 181 

Met Asn He Ala Lys. He Val Arg Glu Ala Arg Glu Gin Ser Arg Leu 
1 5 io 15 

Thr Thr Leu Asp Phe Ala Thr Gly He Phe Asp Glu Phe He Gin Leu 
40 20 25 30 

His Gly Asp Arg Ser Phe Arg Asp Asp' Gly Ala Val Val Gly Gly He 
35 40 45 

45 Gly Trp Leu Gly Asp. Gin Ala Val Thr Val Val Gly He Gin Lys Glv 
50 55 60 

Lys Ser Leu Gin Asp Asn Leu Lys Arg Asn Phe Gly Gin Pro His Pro 
50 65 70 75 so 

Glu Gly Tyr Arg Lys Ala Leu Arg Leu Met Lys Gin Ala Glu Lys Phe 
85 90 95 

Gly Arg Pro Val Val Thr Phe He Asn Thr Ala Gly Ala Tyr Pro Glv 
55 100 105 iio 

Val Gly Ala Glu Glu Arg Gly Gin Gly Glu Ala He Ala Arg Asn Leu 
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115 ™ 125 

Met Glu Met Ser Asp Leu Lys Val Pro lie lie Ala lie n e He Gly 

5 135 140 

Glu Gly Gly Ser Gly Gly Ala Leu Ala Leu Ala Val Ala Asp Arg Val 
. 155 160 



10 



Trp Met Leu Glu Asn Ser lie Tyr Ala lie Leu Ser Pro Glu Gly Phe 
165 170 



175 



Ala Ser lie Leu Trp Lys Asp Gly Thr Arg Ala Met Glu Ala Ala Glu 

lot) 



185 



190 



15 Leu Met Lys lie Thr Ser Ms Glu Leu Leu Glu Met Asp Val Val Asp 



200 205 
Lys Val lie Ser Glu Val Gly Leu Ser Ser Lys Glu Leu lie Lys Ser 
20 215 220 

Val Lys Lys Glu Leu Gin Thr Glu Leu Ala Arg Leu Ser Gin Lys Pro 
3 230 235 



240 



Leu Glu Glu Leu Leu Glu Glu Arg Tyr Gin Arg Phe Arg Lys Tyr 
245 250 255 

<210> 182 
<211> 169 
30 <212> PRT 

<213> Streptococcus pneumoniae 



35 



<400> 182 

Met He Ile Lys Val Glu Met Ala Asp Val Glu Val Leu Ala Lys lie 

. 5 10 15 

Ala Lys Gin Thr Phe Arg Glu Thr Phe Ala Tyr Asp Asn Thr Glu 



20 25 



Glu 
30 



40 Gin Leu Gin Glu Tyr Phe Glu Glu Ala Tyr Ser Leu Lys Thr Leu Ser 
35 40 45 

Thr Glu Leu Gly Asn Pro Asp Ser Glu Thr Tyr Phe He Met His Glu 
ou 55 60 

Glu Glu lie Ala Gly Phe Leu Lys Val Asn Trp Gly Ser Ala Gin Thr 



45 



65 70 ... . 75 



80 



5q Glu Arg Glu Leu Glu Asp Ala Phe Glu Ile Gin Arg Leu Tyr Val Leu 

85 90 95- 

Gin Lys Phe Gin Gly Phe Gly Leu Gly Lys Gin Leu Phe Glu Phe Ala 
100 105 110 

55 



Leu Glu Leu Ala Thr Lys Asn Ser Phe Ser Trp Ala Trp Leu Gly Val 

-no 120 



125 
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Trp Glu His Asn Thr Lys Ala Gin Ala Phe Tyr Asn Arg Tyr Gly Phe 
130 135 140 



Glu Lys Phe Ser Gin His His Phe Met Val Gly Gin Lys Val Asd Thr 
5 14 5 150 155 



160 



Asp Trp Leu Leu Arg Lys Lys' Leu Arg 
165 

10 

<210> 183 
<211> 529 
<212> PRT 
15 <213> Streptococcus pneumoniae 

<400> 183 

Met Leu Arg Gly Thr Ala Leu Leu Thr Ala Ser Asn Phe He Ser Ara 
1 5 10 15 

20 Leu Leu Gly Ala Val Tyr He He Pro Trp Tyr He Trp Met Gly Ala 
20 25 30 

Tyr Ala Ala Lys Ala Asn Gly Leu Phe Thr Met Gly Tyr Thr He Tyr 
25 35 40 45 

Ala Trp Phe Leu Leu Val Ser Thr Ala Gly He Pro Val Ala Val Ala 
50 55 go 

Lys Gin Val Ala Lys Tyr Asn Thr Met Arg Glu Glu Glu His Ser Phe 
30 65 70 75 bo 

Ala Leu He Arg Ser Phe Leu Gly Phe Met Thr Gly Leu Gly Leu Val 
85 90 95 

35 Phe Ala Leu Val Leu Tyr Val Phe Ala Pro Trp Leu Ala Asp Leu Ser 
100 . 105 no 

Gly Val Gly Lys Asp Leu lie Pro He Met Gin Ser Leu Ala Trp Gly 
40 115 120 125 

Val Leu He Phe Pro Ser Met Ser Val He Arg Gly Phe Phe Gin Glv 
130 135 140 

Met Asn Asn Leu Lys Pro Tyr Ala Met Ser Gin He Ala Glu Gin Val 
45 145 150 155 



160 



He Arg Val He Trp Met Leu Leu Ala Thr Phe He lie Met Lys Leu 
1^5 170 175 

50 Gly Ser Gly Asp Tyr Leu Ala Ala Val Thr Gin Ser Thr Phe Ala Ala 
180 185 190 

Phe Val Gly Met Val Ala Ser Phe Ala Val Leu He Tyr Phe Leu Ala 
55 195 ■ 200 205 

Gin Glu Ser Ser Leu Lys Arg Val Phe Glu Thr Gly Asp Lys He Asn 
210 215 220 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Ser Lys Arg Leu Leu Val Asp Thr He Lys Glu Ala lie Pro Phe He 
225 230 235 240 

Leu Thr Gly Ser Ala lie Gin lie Phe Gin lie Leu Asp Gin Leu Thr 
245 250 255 

Phe He Asn Ser Met Ser Trp Phe Thr Asn Tyr Ser Asn Glu Asp Leu 
260 265 270 

Val Val Met Phe .Ser Tyr Phe Ser Ala Asn Pro Asn Lys He Thr Met 
275 280 285 

He Leu lie Ser Val Gly Val Ser lie Gly Ser Val Gly Leu Pro Leu 
290 295 300 

Leu Thr Glu Asn Tyr Val Lys Gly Asp Leu Lys Ala Ala Ser Arg Leu 
d05 310 315 3 20 

Val Gin Asp Ser Leu Thr Leu Leu Phe Met Phe Leu Leu Pro Ala Thr 
325 33 0 335 

Val Gly Val Val Met Val Gly Glu Pro Leu Tyr Thr Val Phe Tyr Glv 
340 345 350 

Lys Pro Asp Ser Leu Ala Leu Gly Leu Phe Val Phe Ala Val Leu Gin 
355 3 6 o 365 

Ser lie He Leu Gly Leu Tyr Met Val Leu Ser Pro Met Leu Gin Ala 
370 375 380 

Met Phe Arg Asn Arg Lys Ala Val Leu Tyr Phe He Tyr Gly Ser He 
85 390 395 400 

Ala Lys Leu Val Leu Gin Leu Pro Thr He Ala Leu Phe His Ser Tyr 
405 410 415 

Gly Pro Leu lie Ser Thr Thr He Ala Leu He He Pro Asn Val Leu 
420 425 43 0 

Met Tyr Arg Asp lie Cys Lys Val Thr Gly Val Lys Arg Lys Val lie 
435 440 445 

Leu Lys Arg Thr He Leu lie Ser Leu Leu Thr Leu Val Met Phe Leu 
450 455 4 60 

Leu lie Gly Thr He Gin Trp Leu Leu.^Gly Phe Phe Phe Gin Pro Ser 
465 470 475 480 

Gly Arg Leu Trp Ser Phe Phe Tyr Val Ala Leu Val Gly Ala Met Gly 
485 4 90 495 

Gly Gly Leu Tyr Met Val Met Ser Leu Arg Thr Tyr Leu Leu Asp Lys 
500 505 510 

Val lie Gly Lys Ala Gin Ala Asp Arg Leu Arg Ala Lys Phe Lys Leu 
515 520 525 
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Ser 

5 . 

<210> 184 
<211> 155 
<212> PRT . 

<213> Streptococcus pneumoniae 
<400> 184 

Met Ser Asp Lys lie Gly Leu Phe Thr Gly Ser Phe Asp Pro Met Thr 
1 5 io • 15 

15 Asn Gly His Leu Asp He He Glu Arg Ala Ser Arg Leu Phe Asp Lys 
20 25 30 



20 



Leu Tyr Val Gly lie Phe Phe Asn Pro His Lys Gin Gly Phe Leu Pro 
35 40 45 

He Glu Asn Arg Lys Arg Gly Leu Glu Lys Ala Leu Gly His Leu Glu 
50 • ' 55 60 



Asn Val Glu Val Val Ala Ser His Asp Glu Leu Val Val Asp Val Ala 
65 70 75 



80 



Lys Arg Leu Gly Ala Thr Cys Leu Val Arg Gly Leu Arg Asn Ala Ser 
85 go 95 

30 Asp Leu Gin Tyr Glu Ala Ser Phe Asp Tyr Tyr Asn His Gin Leu Ser 
100 105 no 



35 



Ser Asp lie Glu Thr He Tyr Leu His Ser Arg Pro Glu His Leu Tyr • 
115 120 125 

He Ser Ser Ser Gly Val Arg Glu Leu Leu Lys Phe Gly Gin Asp He 
I 30 135 140 



Ala Cys Tyr Val Pro Glu Ser He Trp Arg Lys 
40 145 150 . 155 

<210> 185 
<211> 143 
45 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 185 

Met Thr He Leu Phe Val Val He Ser Ala Ser Phe Leu Tyr Met Val 
. 1 5 10 15 

Ser Leu Ser Met Lys Pro Tyr Gin Thr Ala Lys Ser Glu Gly Glu Lvs 
20 '25 30 

55 Leu Ala Gin Gin Tyr Ala Gly Leu Glu Gin Ala Asp Gin Val Asp Leu 
35 40 - 45 
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Tyr Asn Gly Leu Glu Ser Tyr Tyr Ser Val Leu Gly Arg Asn Lys Gin 
50 55 60 

Gin Glu Ala Leu Ala Val Leu He Gly Lys Asp Asp His Lys He Tyr 
^ 65 '70 75 80 

Val Tyr Gin Leu Asn Gin Gly Val Ser Gin Glu Lys Ala Glu Thr Val 
85 go 95 

10 Ser Lys Glu Lys Gly Ala Gly Glu He Asp Lys He He Phe Gly Ara 
100 105 110 

Tyr Gin Asp Lys Pro lie Trp Glu Val Lys Ser Gly Ser Asp Phe Tvr 
J5 I" 120 125 

Leu Val Asp Phe Glu Thr Gly Ala Leu Val Asn Lys Glu Gly Leu 
130 135 140 

20 <210> 186 
<211> 243 
<212> PRT 

<213> Streptococcus pneumoniae 
25 <400> 186 

Met He Asp He His Ser His lie Val Phe Asp Val Asp Asp Gly Pro 
1 5 io 15 

Lys Ser Arg Glu Glu Ser Lys Ala Leu Leu Thr Glu Ala Tyr Arg Gin 
iU 20 25 30 

Gly Val Arg Thr lie Val Ser Thr Ser His Arg Arg Lys Gly Met Phe 
35 40 45 

35 Glu Thr Pro Glu Glu Lys He Ala Glu Asn Phe Leu Gin Val Arq Glu 
50 55 60 



40 



lie Ala Lys Glu Val Ala Ser Asp. Leu Val He Ala 'Tyr Gly Ala Glu 
65 70 75 80 

He Tyr Tyr Thr Pro Asp Val Leu Asp Lys Leu Glu Asn Asn Arg He 
85 90 95 



Pro Thr Leu Asn Asn Ser Arg Tyr Ala Leu lie Glu Phe Ser Met Asn 
45 100 105 no 

Thr Pro Tyr Arg Asp He His Ser Ala. Leu Asn Lys lie Leu Met Leu 
115 120 125 

50 Gly lie Thr Pro Val lie Ala His lie Glu Arg Tyr Asp Val Leu Glu 
130 135 140 



55 



Asn Asn Glu Lys' Arg Val Arg Glu Leu lie Asp Met Gly Cys Tyr Thr 
145 150 155 i 6 o 

Gin lie Asn Ser Ser His Val Leu Lys Ser Lys Leu Phe Gly Glu Pro 
165 i7o 175 
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Tyr Lys Phe Met Lys Lys Arg Ala Gin Tyr Phe Leu Glu Arg Asp Leu 
!80 185 190 

5 Val His lie He Ala Ser Asp Met His Asn Val Asp Gly Arg Pro Pro 
195 200 205 



10 



15 



His Met Ala Glu Ala Tyr Asp Leu Val Ser Gin Lys Tyr Gly Glu Ala 
210 215 220 



Lys Ala Gin Glu Leu Phe He Asp Asn Pro Arg Lys He Val Met Asd 
00 * 230 .235 2 40 



225 
Gin Leu lie 



<210> 187 

<211> 308 ' 
20 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 187 

Met Ser Thr He Asp Lys Glu Lys Phe Gin Phe Val Lys Arg Asp Asp 

Phe Ala Ser Glu Thr He Asp Ala Pro Ala Tyr Ser Tyr Trp Lys Ser 
20 - 25 30 

30 Val Phe Lys Gin Phe Met Lys Lys Lys Ser Thr Val Val Met Leu Glv 
35 40 45 

He Leu Val Ala He He Leu He Ser Phe He Tyr Pro Met Phe Ser 

35 

Lys Phe Asp Phe Asn Asp Val Ser Lys Val Asn Asp Phe Ser Val Ara 
65 ™ 75 so 

Tyr He Lys Pro Asn Ala Glu His Trp Phe Gly Thr Asp Ser Asn Gly 
40 85 90 95 

Lys Ser Leu Phe Asp Gly Val Trp Phe Gly Ala Arg Asn Ser He Leu - 
100 105 no 

45 He Ser Val He Ala Thr Val He Asn Leu Val He Gly Val Phe Val 
115 120 125 

Gly Gly He Trp Gly He Ser Lys Ser Val Asp Arg Val Met Met Glu 

50 

Val Tyr Asn Val He Ser Asn He Pro Pro Leu Leu He Val He Val 

145 150 155 iso 

Leu Thr Tyr Ser lie Gly Ala Gly Phe Trp Asn Leu He Phe Ala Met 
^ 165 170 175 



Ser Val Thr Thr Trp He Gly lie Ala Phe Met He Arg Val Gin II 



e 
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■ 180 185 4 190 

Leu Arg Tyr Arg Asp Leu Glu Tyr Asn Leu Ala Ser Arg Thr Leu Gly 
195 200 205 

Thr Pro Thr Leu Lys He Val Ala Lys Asn He Met Pro Gin Leu Val 
210 215 220 

Ser Val He Val Thr Thr Met Thr Gin Met Leu Pro Ser Phe He Ser 
10 225 230 235 240 

Tyr Glu Ala Phe Leu Ser Phe Phe Gly Leu Gly Leu Pro He Thr Val 
245 250 255 

15 Pro Ser Leu Gly Arg Leu He Ser Asp Tyr Ser Gin Asn Val Thr Thr 
260 265 270 



20 



Asn Ala Tyr Leu Phe Trp He Pro Leu Thr Thr Leu Val Leu Val Ser 
275 280 285 

Leu Ser Leu Phe Val Val Gly Gin Asn Leu Ala Asp Ala Ser Asp Pro 
290 295 300 



Arg Thr His Arg 
25 305 



<210> 188 
<211> 77 
30 <212> PRT 

- <213> Streptococcus pneumoniae 

<400> 188 

Met Tyr Asn Leu Leu Leu Thr He Leu Leu Val Leu Ser Val Val He 
35 1 5 .10 15 

Val He Ala He Phe Met Gin Pro Thr Lys Asn Gin Ser Ser Asn Val 
20 25 30 

40 Phe Asp Ala Ser Ser Gly Asp Leu Phe Glu Arg Ser Lys Ala Arg Gly 
35 40 45 

Phe Glu Ala Val Met Gin Arg Leu Thr Gly He Leu Val Phe Phe Trp 
^ 50 55 60 

Leu Ala He Ala Leu Ala Leu Thr Val Leu Ser Ser Arg 
65 70 * " . 75 

50 <210> 189 
<211> 369 
<212> PRT 

<213> Streptococcus pneumoniae 
55 <400> 189 

Met Phe Arg Arg Asn Lys Leu Phe Phe Trp Thr Thr Glu He Leu Leu 
15 io 15 
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Leu Thr lie lie Phe Tyr Leu Trp Arg Gin Met Gly Ser Leu lie Asn 
20 25 30 

5 Pro Phe Val Ser Val Leu Asn Thr lie Met lie Pro Phe Leu Leu Gly 
35 40 45 



Gly Phe Leu Tyr Tyr Leu Thr Asn Pro lie Val Thr Phe Leu Asn Lys 
50 55 60 

Val Cys Lys Leu Asn Arg Leu Leu Gly lie Leu He Thr Leu Cys Thr 
65 -70 75 80 

Leu Val Trp Gly Met Val He Gly Val Val Tyr Leu Leu Pro He Leu 
85 90 95 

He Asn Gin Leu Ser Ser Leu He He Ser Ser Gin Thr He Tyr Ser 
100 105 no 

20 Arg Val Gin Asp Leu He He Asp Leu Ser Asn Tyr Pro Ala Leu Gin 
115 120 125 



10 



15 



Asn Leu Asp Val Glu Ala Thr He Gin Gin Leu Asn Leu Ser Tyr Val 
. 130 135 140 

Asp He Leu Gin Asn lie Leu Asn Ser Val Ser Asn Ser Val Gly Ser 
145 150 155 i 6 o 

Val Leu Ser Ala Leu He Ser Thr Val Leu lie Leu He Met Thr Pro 
165 ^ 170 175 

Val Phe Leu Val Tyr Phe Leu Leu Asp Gly His Lys Phe Leu Pro Met 
180 185 190 

35 Leu Glu Arg Thr He Leu Lys Arg Asp Arg Leu His lie Ala Gly Leu 
195 200 205 



25 



30 



40 



45 



Leu Lys Asn Leu Asn Ala Thr He Ala Arg Tyr He Ser Gly Val Ser 
210 215 220 

lie Asp Ala He He lie Gly Cys Leu Ala Tyr lie Gly Tyr Ser He 
225 230 235 240 

He Gly Leu Lys Tyr Ala Leu Val Phe Ala lie Phe Ser Gly Val Ala 
245 250 255 



Asn Leu lie Pro Tyr Val Gly Pro Ser. He Gly Leu lie Pro Met lie 
260 265 270 

50 He Ala Asn lie Phe Thr Val Pro His Arg Leu Leu He Ala Val lie 
275 " 280 285 



55 



Tyr Met Leu Val Val Gin Gin Val Asp Gly Asn He Leu Tyr Pro Arq 
290 295 300 

He Val Gly Ser Val Met Lys Val His Pro lie Thr lie Leu Val Leu 
305 310 315 320 
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Leu Leu Leu Ser Ser Asn He Tyr Gly Val Val Gly Met He Val Ala 
325 330 335 

5 Val Pro Thr Tyr Ser He Leu Lys Glu He Ser Lys Phe Leu Ser Ara 
340 345 350 



10 



Leu Tyr Glu Asn His Lys He Met Lys Glu Arg Glu Arg Glu Leu Ala 
355 - 360 365 

Lys 



15 <210> 190 
<211> 451 
<212> PRT 

<213> Streptococcus pneumoniae 
20 <400> 190 

Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Asn Phe Ser Gin 
15 10 15 

Leu Val Gly Gin Glu Val Val Ala Lys Thr Leu Lys Gin Ala Val Glu 
25 20 25 ' 30 

Gin Glu Lys He Ser His Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 40 45 

30 Gly Lys Thr Ser Val Ala Lys He Phe Ala Lys Ala Met Asn Cys Pro' 
50 55 60 



35 



Asn Gin Val Gly Gly Glu Pro Cys Asn Asn Cys Tyr He Cys Gin Ala 
65 ™ 75 80 

Val Thr Asp Gly Ser Leu Glu Asp Val He Glu Met Asp Ala Ala Ser 
85 90 95 



^ Asn Asn Gly Val Asp Glu He Arg Glu He Arg Asp Lys Ser Thr Tyr 
40 100 105 no 

Ala Pro Ser Leu Ala Arg Tyr Lys Val Tyr He He Asp Glu Val His 
115 120 125 

45 Met Leu Ser Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 
130 135 140 



Pro Thr Gin Asn Val Val Phe He Leu Ala Thr Thr Glu Leu His Lys 

50 



145 150 - 155 160 



He Pro Ala Thr He Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ser 
165 170 175 

He Lys Thr Gin Asp He Lys Glu His He His Tyr lie Leu Glu Lys 
55 180 185 190 

Glu Asn He Ser Ser Glu Pro Glu Ala Val Glu He lie Ala Arg Arg 
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195 



200 



205 



Ala Glu Gly Gly Met Arg Asp Ala Leu Ser He Leu Asp Gin Ala Leu 
210 215 220 



10 



Ser Leu Thr Gin Gly Asn Glu Leu Thr Thr Ala He Ser Glu Glu He 
225 230 235 240 

Thr Gly Thr lie Ser Leu Ser Ala Leu Asp Asp Tyr Val Ala Ala Leu 
245 250 255 

Ser Gin Gin Asp Val Pro Lys Ala Leu Ser Cys Leu Asn Leu Leu Phe 
2 60 265- 270 

15 Asp Asn Gly Lys Ser Met Thr Arg Phe Val Thr Asp Leu Leu His Tvr 
27 5 280 285 

Leu Arg Asp Leu Leu He Val Gin Thr Gly Gly Glu Asn Thr His His 
290 295 300 

Ser Ser Val Phe Val Glu Asn Leu Ala Leu Pro Gin Lys Asn Leu Phe 
305 310 315 320 



20 



25 



Glu Met He Arg Leu Ala Thr Val Asn Leu Ala Asp He Lys Ser Se 



325 330 335 



er 



Leu Gin Pro Lys He Tyr Ala Glu Met Met Thr Val Arg Leu Ala Glu 
340 345 35 0 

30 He Lys Pro Glu Pro Ala Leu Ser Gly Ala Val Glu Asn Glu He Ala 
355 360 365 



35 



Thr Leu Arg Gin Glu Val Ala Arg Leu Lys Gin Glu Leu Ser Asn Ala 
370 375 380 

Gly Ala Val Pro Lys Gin Val Ala Pro Ala- Pro Ser Arg Pro Ala Thr 
385 390 . 395 4 00 



Gly Lys Thr Val Tyr Arg Val Asp Arg Asn Lys Val Gin Ser He Leu 
40 . 405 410 415 

Gin Glu Ala Val Glu Asn Pro Asp Leu Thr Arg Gin Asn Leu He Ara 



420 



.425 



430 



45 Leu Gin Asn Ala Trp Gly Glu Val He Glu Ser Leu Gly Gly Pro 



435 



440 



445 



Asp 



50 



Lys Leu Cys 
450 



<210> 191 
<211> 662 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 191 
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Met Phe Arg Leu Thr Asn Lys Leu Ala Val Ser Asn Leu lie Lys Asn 
15 10 15 

Arg Lys Leu Tyr Tyr Pro Phe Ala Leu Ala Val Leu Leu Ala Val Thr 
5 20 25 3 0 

Leu Thr Tyr Leu Phe Tyr Ser Leu Thr Phe Asn Pro Lys lie Ala Glu 
35 40 45 

10 He Arg Gly Gly Thr Thr He Gin Ala Thr Leu Gly Phe Glv Met Phe 
50 55 60 



Val Val Thr Leu Ala Ser Ala He He Val Leu Tyr Ala Asn Ser Phe 
, 5 65 



Val Met Lys Lys Arg Ser Lys Glu Leu Gly He Tyr Gly Met Leu Gly 
85 90 95 

Leu Glu Lys Arg His Leu He Ser Met Thr Phe Lys Glu Leu Val Val 
ZU 100 105 no 

Phe Gly He Leu Thr Val Gly Ala Gly lie Gly He Gly Ala Leu Phe 
115 120 125 

25 Asp Lys Leu He Phe Ala Phe Leu Leu Lys Leu Met Lys Leu Lys Val 
13 ° 135 140 



Glu Leu Val Ala Thr Phe Gin Thr Lys Val Val He Thr Val Leu Val 

30 



145 150 155 i 60 



Val Phe Gly Leu He Phe Leu Gly Leu Met Phe Leu Asn Ala Leu Arg 
165 170 175 

lie Ala Arg Met Asn Ala Leu Gin Leu Ser Arg Glu Lys Ala Ser Glv 
55 ■ 180 185 igo 

Glu Lys Lys Gly Arg Phe Leu Pro Leu Gin Thr He Leu Gly Ser He 
195 200 205 

40 Ser Leu Gly He Gly Tyr Tyr Leu Ala Leu Thr Val Lys Asp Pro Leu 
210 215 220 

Thr Ala Leu Thr Thr Phe Phe He Ala Val Leu Leu Val He Phe Glv 
45 225 230 235 240 

Thr Tyr Leu Leu Phe Asn Ala Gly He Thr Val Phe Leu Gin lie Leu 
245 250 255 

Lys Lys Asn Lys Lys Tyr Tyr Tyr Gin Pro Asn Asn Leu lie Ser Val 
W . 260 265 270 

Ser Asn Leu He Phe Arg Met Lys Lys Asn Ala Val Gly Leu Ala Thr 
275 280 285 

55* He Ala He Leu Ser Thr Met Val Leu Val Thr Met Ser Ala Ala Thr 
290 295 300 



134 



W ° 01/49721 PCT/US00/35604 



15 



20 



30 



35 



Ser lie Phe Asn Ser Ala Glu Ser Phe Lys Lys Val Leu Asn Pro His 
305 310 . 315 320 

Asp Phe Gly Val Ser Gly Gin Asn Val Glu Lys Glu Asp Leu Asp Lys 
D 325 330 335 

Leu Leu Ser Gin Phe Ala Ser Asp Asn Gly Tyr Lys Il e L ys Glu Lys 
340 345 350 

10 Glu Val Phe Arg Tyr Thr Tyr Phe Gly Val Ala Asn Gin Glu Gly Asn 
355 360 365 



Lys Leu Thr Phe Phe Glu Lys Gly Gin Asn Arg Val Gin Pro Thr Thr 
370 375 3 8 o 

Val Phe Met Val Phe Asp Gin Lys Asp Tyr Glu Asn Met Thr Gly Gin 
385 390 395 400 

Lys Leu Ser Leu Ser Gly Asn Glu Val Gly Leu Phe Ala Lys Asn Asp 
405 410 415 • 

Gly Leu Lys Gly Gin Lys Thr Leu Ile Leu Asn Asp His Gin Phe Ser 
420 425 4 3o 

25 Val Lys Glu Glu Phe Asn Lys Asp Phe Ile Val Asn His Val Pro Asn 
435 440 445 

Gin Phe Asn Ile Leu Thr Ala Asp Tyr Asn Tyr Leu Val Val Pro Asd 
450 455 460 

Leu Gin Ala Phe Leu Asn Gin Phe Pro Asp Ser Asp Ile Tyr Asn Gin 
465 4 ™ 475 480 

Phe Tyr Gly Gly Met Asn Val Asn Val Ser Glu Glu Glu Gin Leu Lys 
485 490 495 

Val Ala Glu Glu Tyr Glu Asn Tyr Leu Asn Gin Phe Asn Ala Gin Leu 
500 505 510 

40 Asp Thr Glu Gly Ser Tyr Val Tyr Gly Ser Asn Leu Ala Asp Ala Ser 
515 520 525 

Ser Gin Met Ser Ala Leu Phe Gly Gly Val Phe Phe lie Gly Ile Phe 
45 530 535 540 

Leu Ser Ile Ile Phe Met Val Gly Thr Val Leu Val Ile Tyr Tyr Lvs 

545 550 .... 555 560 

Gin lie Ser Glu Gly Tyr Glu Asp Arg Glu Arg Phe lie lie Leu Gin 
3U 565 570 575 

Lys Val Gly Leu Asp Gin Lys Gin Ile Lys Gin Thr Ile His Lys Gin 
580 585 590 

55 Val Leu Thr Val Phe Phe Leu Pro Leu Leu Phe Ala Phe Ile His Leu 
595 6oo 605 
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Ala Phe Ala Tyr His Met Leu Ser Leu He Leu Lys Val He Glv Val 

610 615 620 

leu Asp Thr Thr Met Met Leu lie Val Thr Leu Ser He Cys Ala He 
0 625 630 635 



64 0 



Phe Leu He Ala Tyr Val Leu He Phe Met lie Thr Ser Arg Ser Tvr 
645 650 655 

10 Arg Lys He Val Gin Met 
660 



<210> 192 
15 <211> 296 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 192 

20 Met Lys Gin Asp Gin Leu Lys Ala Trp Gin Pro Ala Gin Phe Asp Arg 
. 5 10 15 

Phe Val Arg lie Leu Glu Gin Asp Gin Leu Asn His Ala Tyr Leu Phe 

25 



20 25 30 



Ser Gly Phe Phe Gly Ser Leu Glu Met Ala Gin Phe Leu Ala Lys Ser 
35 40 45 

. Leu Phe Cys Thr Asp Lys Val Gly Val Leu Pro Cys Glu Lys Cys Arg 

50 55 gQ 

Ser Cys Lys Leu He Glu Gin Glu Glu Phe Pro Asp Val Thr Leu He 
65 70 75 go 

35 Lys Pro Val Asn Gin Val lie Lys Thr Glu Arg He Arg Glu Leu Val 

85 go 95 

Gly Gin Phe Ser Gin Ala Gly lie Glu Ser Gin Gin Gin Val Phe IU 

40 



ioo 105 110 



He Glu Gin Ala Asp Lys Met His Pro Asn Ala Ala Asn Ser Leu Leu 
115 120 125 

Lys Val He Glu Glu Pro Gin Ser Glu Val Tyr He Phe Phe Leu Thr 
4:> 130 135 140 

Ser Asp Glu Glu Lys Met Leu Pro Thr He Arg Ser Arg Thr Gin He 
145 150 •- 155 160 

50 Phe His Phe Lys Lys Gin Glu Glu' Lys Leu He Leu Leu Leu Glu Glh 
' 165 170 175 

Met Gly Leu Val Lys Lys Lys Ala Thr Leu Leu Ala Lys Phe Ser Gin 
55 180 "5 190 

Ser Arg Ala Glu Ala Glu Lys Leu Ala Asn Gin Ala Ser Phe Trp Thr 
195 200 205 
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Leu Val Asp Glu Ser Glu Arg Leu Leu Thr Trp Leu Val Ala Lys Lvs 
210 215 220 

5 Lys Glu Ser Tyr Leu Gin Val Ala Lys Leu Ala Asn Leu Ala Asp Asp 
225 230 235 240 



10 



Lys Glu Lys Gin Asp Gin Val Leu Arg He Leu Glu Val Leu Cvs Glv 
245 250 255 

Gin Asp Leu Leu Gin Val Arg Val Arg Val He Leu Gin Asp Leu Leu 
260 . 265 270 



Glu Ala Arg Lys Met Trp Gin Ala Asn Val Ser Phe Gin Asn Ala Met 
1!> 275 280 285 



20 



25 



Glu Tyr Leu Val Leu Lys Glu He 
290 295 



<210> 193 
<211> 204 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 193 

• Met Asn Ser Phe Lys Asn Phe Leu Lys Glu Trp Gly Leu Phe Leu Leu 
1 . . 5 10 15 

30 He Leu Ser Leu Leu Ala Leu Ser Arg lie Phe Phe Trp Ser Asn Val 
20 25 30 

Arg Val Glu Gly His Ser Met Asp Pro Thr Leu Ala Asp Gly Glu He 
35 -35 40 45 

Leu Phe Val Val Lys His Leu Pro He Asp Arg Phe Asp lie Val Val 
50 55 60 

Ala His Glu Glu Asp Gly Asn Lys Asp He Val Lys Arg Val lie Glv 
40 65 70" 75- so 

Met Pro Gly Asp Thr He Arg Tyr Glu Asn Asp Lys Leu Tyr He Asn 
85 90 95 

45 Asp Lys Glu Thr Asp Glu Pro Tyr Leu Ala Asp Tyr He Lys Arg Phe 
100 105 no 

Lys Asp Asp Lys Leu Gin Ser Thr Tyr Ser Gly Lys Gly Phe Glu Gly 

50 

Asn Lys Gly Thr Phe Phe Arg Ser He Ala Gin Lys Ala Gin Ala Phe 
130 135 140 

Thr Val Asp Val Asn Tyr Asn Thr Asn Phe Ser Phe Thr Val Pro Glu 
55 145 150 155 leo 

Gly Glu Tyr Leu Leu Leu Gly Asp Asp Arg Leu Val Ser Ser Asp Ser 
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165 170 



175 



Arg His Val Gly Thr Phe Lys Ala Lys Asp He Thr Gly Glu Ala Lys 

5 180 185 190 ' 

Phe Arg Phe Trp Pro He Thr Arg He Gly Thr Phe 
195 200 



10 <210> 194 
<211> 328 
<212> PRT 

<213> Streptococcus pneumoniae 
15 <400> 194 

Met Val Val Phe Thr Gly Ser Thr Val Glu Glu Ala He Gin Lys Glv 
1 5 - 10 Is ' 

Leu Lys Glu Leu Asp He Pro Arg Met Lys Ala His He Lys Val lie 
20 20 25. 30 

Ser Arg Glu Lys Lys Gly Phe Leu Gly Leu Phe Gly Lys Lys Pro Ala 
35 40 45 

25 Gin Val Asp lie Glu Ala He Ser Glu Thr Thr Val Val Lys Ala Asn 
50 55 60 



30 



45 



Gin Gin Val Val Lys Gly Val Pro Lys Lys lie Asn Asp Leu Asn Glu 
65 70 75 80 

Pro Val Lys Thr Val Ser Glu Glu Thr Val Asp Leu Gly His Val Val 
85 90 95 



Asn Ala He Lys Lys He Glu Glu Glu Gly Gin Gly He Ser Asp Glu 
35 100 105 no 

Val Lys Ala Glu He Leu Lys His Glu Arg His Ala Ser Thr lie Leu 
115 120 125 

40 Glu Glu Thr Gly His lie Glu He Leu Asn Glu Leu Gin lie Glu Glu 
130 135 140 



Ala Met Arg Glu Glu Ala Gly Ala Asp Asp Leu Glu Thr Glu Gin Asp 
14 5 150 155 160 

Gin Thr Glu Asn Gin Asp Leu Lys Glu Met Gly Leu Lys Val Glu Gin 
165 .170 175 



Ser Tyr Asp lie Ala Gin Val Ala Thr Asp Val Thr -Ala Tyr Val Gin 
50 180 185 190 

Ala He Val Asp Asp Met Asp Val Glu Ala Thr Leu Ser Asn Asp Tyr 
195 200 205 

55 Asn Arg Arg Ser lie Asn Leu Gin He Asp Thr Asn Glu Pro Gly Arq 
210 215 220 
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Ue lie Gly Tyr His Gly Lys Val Leu Lys Ala Leu Gin Leu Leu Ala 
5 230 235 240 

Gin Asn Tyr Leu Tyr Asn Arg Tyr Ser Lys Thr Phe Tyr Val Thr lie 
245 250 255 . 

Asn Val Asn Asp Tyr Val Glu His Arg Ala Glu Val Leu Gin Thr Tyr 
260 265 270 

10 Ala Gin Lys Leu Ala Asn Arg Val Leu Glu Glu Gly Arg Ser His Lys 
^'3 280 



285 



15 



20 



Thr Asp Pro Met Ser Asn Ser Glu Arg Lys lie lie His Arg lie He 

Ser Arg Met Asp Gly Val Thr Ser Tyr Ser Glu Gly Asp Glu Pro Asn 
5 310 315 32Q 

Arg Tyr Val Val Val Asp Thr Glu 
325 



<210> 195 
<211> 460 
25 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 195 

Met Ser Asn Phe Ala He lie Leu Ala Ala Gly Lys Gly Thr Arg Met 
1 5 10 15 

Lys Ser Asp Leu Pro Lys Val Leu His Lys Val Ala Gly He Ser Met 
20 25 30 

35 Leu Glu His Val Phe Arg Ser Val Gly Ala lie Gin Pro Glu Lys Thr 
J5 40 45 

Val Thr Val Val Gly His Lys Ala Glu Leu Val Glu Glu Val Leu Ala 

40 55 60 

Gly Gin Thr Glu Phe Val Thr Gin Ser. Glu Gin Leu Gly Thr Gly His 

65 70 -75 Y 80 

Ala Val Met Met Thr Glu Pro lie Leu Glu Gly Val Ser Gly His Thr 
85 90 95 

Leu Val He Ala Gly Asp Thr Pro Leu -He Thr Gly Glu Ser Leu Lvs 
100 105 110 

50 Asn Leu He Asp Phe His He Asn His Lys Asn Val Ala Thr He Leu 
115 120 125 



55 



Thr Ala Glu Thr Asp Asn Pro Phe Gly Tyr Gly Arg He Val Arg Asn 

135 140 



Asp Asn Ala Glu Val Leu Arg Ser Leu Leu Ser Arg Arg Met Leu Gin 
■ 150 155 • i 60 
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. 5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



He Leu Lys Ser Lys Ser Arg Lys Ser Thr Leu Val Thr Tyr Val Phe 
165 170 175 

Asp Asn Glu Arg Leu Phe Glu Ala Leu Lys Asa lie Asn Thr Asn Asn 
180 185 190 

Ala Gin Gly Glu Tyr Tyr lie Thr Asp Val He Gly He Phe Arg Glu 
i95 200 205 

Thr Gly Glu Lys Val Gly Ala Tyr Thr Leu Lys Asp Phe Asp Glu Ser 
210 215 220 

Leu Gly Val Asn Asp Arg Val Ala Leu Ala Thr Ala Glu Ser Val Met 
5 230 235 240 

Arg Arg Arg He Asn His Lys His Met Val Asn Gly Val Ser Phe Val 
245 .250 255 

Asn Pro Glu Ala Thr Tyr lie Asp lie Asp Val Glu He Ala Pro Glu 
260 265 270 

Val Gin lie Glu Ala Asn Val lie Leu Lys Gly Gin Thr Lys He Gly 
275 280 285 

Ala Glu Thr Val Leu Thr Asn Gly Thr Tyr Val Val Asp Ser Thr He 
290 295 300 

Gly Ala Gly Ala Val He Thr Asn Ser Met He Glu Glu Ser Ser Val 
J ° 5 310 315 3 20 

Ala Asp Gly Val Thr Val Gly Pro Tyr Ala His lie Arg Pro Asn Ser 
325 33 0 ■ 335 

Ser Leu Gly Ala Gin Val His He Gly Asn Phe Val Glu Val Lvs Glv 
340 345 350 

Ser Ser lie Gly Glu Asn Thr Lys Ala Gly His Leu Thr Tyr He Gly 
355 360 365 

Asn Cys Glu Val Gly Ser Asn Val Asn Phe Gly Ala Gly Thr He Thr 
J/u 375 380 

Val Asn Tyr Asp Gly Lys Asn Lys Tyr Lys Thr Val He Gly Val Asn 
J85 390 395 400 

Val Phe Val Gly Ser Asn Ser Thr lie. He Ala Pro Val Glu Leu Glv 
405 410 415 

Asp Asn Ser Leu Val Gly Ala Gly Ser Thr He Thr Lys Asp Val Pro 
420 425 43 0 

Ala Asp Ala He Ala He Gly Arg Gly Arg Gin He Asn Lys Asp Glu 
4J -> 440 445 

Tyr Ala Thr Arg Leu Pro His His Pro Lys Asn Gin 
450 455 46Q 
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<210> 196 
<211> 311 
5 <212> PRT 

<21'3> Streptococcus pneumoniae 

<400> 196 



Met Ser Lys lie Leu Val Phe Gly His Gin Asn Pro Asp Ser Asp Ala 

• 5 10 15 

He Gly Ser .Ser Val Ala Phe Ala Tyr Leu Ala Lys Glu Ala Tyr Gly 
20 25 30 

15 Leu Asp Thr Glu Ala Val Ala Leu Gly Thr Pro Asn Glu Glu Thr Ala 

■33 4Q 



45 



Phe Val Leu Asn Tyr Phe Gly Val Glu Ala Pro Arg Val lie Thr Ser 
20 . 55 60 

Ala Lys Ala Glu Gly Ala Glu Gin Val lie Leu Thr Asp „ is Asn ^ 

70 75 80 

^ Phe Gin Gin Ser Val Ser Asp He Ala Glu Val Glu Val Tyr Gly Val 



90 



95 



Val Asp His His Arg Val Ala Asn Phe -Glu Thr Ala Ser Pro Leu Tyr 
100 1Q 5 110 

30 Met Arg Leu Glu Pro Val Gly Ser Ala Ser Ser lie Val Tyr Arg Met 

Phe Lys Glu His Gly Val Ala Val Pro Lys Glu He Ala Gly Leu Met 
35 140 

Leu Ser Gly Leu lie Ser Asp Thr Leu Leu Leu Lys Ser Pro Thr Thr 

150 155 



160 

His Pro Thr Asp Lys lie lie Ala Pro Glu Leu Ala Glu Leu Ala Gly 
165 170 * 



175 



Val Asn Leu Glu Glu Tyr Gly Leu Ala Met Leu Lys Ala Gly Thr Asn 

185 190 



45 Leu Al 



a Ser Lys Ser Ala Glu Glu Leu lie Asp. He Asp Ala Lys Thr 
x 200 



205 



Phe Glu Leu Asn Gly Asn Asn Val Arg Val Ala Gin Val Asn Thr Val 
50 215 220 

Asp lie Ala Glu Val Leu Glu Arg Gin Ala Glu He Glu Ala Ala Met 
230 235 240 

55 Gin Ala Ala Asn Glu Ser Asn Gly Tyr Ser Asp Phe Val Leu Met lie 

245 250 255 

Thr Asp lie Val Asn Ser Asn Ser Glu lie Leu Ala Leu Gly Ala Asn 
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260 



265 



270 



Met Asp Lys Val Glu Ala Ala Phe Asn Phe L ys Leu Glu Asn Asn His 

280 285 
Ala Phe Leu Ala Gly Ala v.J ser Arg Lys Lys ^ Val Val prQ Gln 



300 



Leu Thr Glu Ser Phe Asn Thr 
10 305 310 



<210>*197 
<211> 225 
15 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 197 

2Q Met He Ser Lys Arg Leu Glu Leu Val Ala Ser Phe Val Ser Gin Gly 

10 15 



Ala He Leu Leu Asp Val Gly Ser Asp His Ala T y r Leu Pro He Glu 

25 ■ 30 
25 Leu Val Glu Arg Gly Gin lie Lys Ser Ala lie Ala Gly Glu Val Val 



45 



Glu Gly Pro T y r Gin Ser Ala Val Lys Asn Val Glu Ala His Gly Leu 
30 60 

LJJ Glu Lys lie Gin Val Arg Leu Ala Asn Gly Leu Ala Ala Phe Glu 



• 75 



80 



35 



Glu Thr Asp Gin Val Ser Val lie Thr lie Ala Gly Met Gly Gly Arg 

90 95 

Leu He Ala Arg He Leu Glu Glu Gly Leu Gly Lys Leu Ala Asn Val 

105 11Q 

40 Glu Arg Leu He Leu Gin Pro Asn Asn Arg Glu Asp Asp Leu Arg Ile 



125 



Trp Leu Gin Asp His Gly Phe Gin . lie Val Ala Glu Ser lie Leu Glu 

45 135 140 

Glu Ala Gly Lys Phe Tyr Glu He Leu Val Val Glu Ala Gly Gin Met 

155 160 
s Lys Leu Ser Ala Ser Asp Val Arg Phe" Gly Pro Phe Leu Ser Lys Glu 

165 170 175 . 



Val Ser Pro Val Phe Val' Gin Lys Trp Gin Lys Glu Ala Glu Lys Leu 

185 190 

55 Glu Phe Ala Leu Glv Gin Hp n„ T 

195 y b - Ln IIe P Jo Glu Lys Asn Leu Glu Glu Arg Gin 



205 
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Val Leu Val Asp Lys He Gin Ala He Lys Glu Val Leu His Val Ser 
210 215 220 



Lys 
5 225 



<210> 198 
<211> 161 
10 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 198 

Met Asn Leu Asn Asp He Lys Asp Leu Met Thr Gin Phe Asp Gin Ser 
15 1 5 io .15 

Ser Leu Arg Glu Phe Ser Tyr Lys Asn Gly Thr Asp Glu Leu Gin Phe 
20 25 30 

20 Ser Lys Asn Glu Ala Arg Pro Val Pro Glu Val Ala Thr Gin Val Ala 
35 40 45 

Pro Ala Pro Val Leu Ala Thr Pro Ser Pro Val Ala Pro Thr Ser Ala 

25 

Pro Ala Glu Thr Val Ala Glu Glu Val Pro Ala Pro Ala Glu Ala Ser 
65 - 70 75 80 

Val Ala Ser Glu Gly Asn Leu Val Glu Ser Pro Leu Val Gly Val Val 
30 85 90 95 

Tyr Leu Ala Ala Gly Pro Asp Lys Pro Ala Phe Val Thr Val Gly Asp 

ioo 105 no 

35 Ser Val Lys Lys Gly Gin Thr Leu Val He lie Glu Ala Met Lys Val 
H5 120 125 

Met Asn Glu lie Pro Ala Pro Lys Asp Gly Val Val Thr Glu lie Leu 
An 130 135 140 

40 



Val Ser Asn Glu Glu Met Val Glu Phe Gly Lys Gly Leu Val Arq lie 
145 150 hr- 

45 



150 155" - 160 

Lys 



<210> 199 
<211> 411 
50 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 199 

Met Lys Leu Asn Arg Val Val Val Thr Gly Tyr Gly Val Thr Ser Pro 
1 5 io 15 

He Gly Asn Thr Pro Glu Glu Phe Trp Asn Ser Leu Ala Thr Gly Lys 
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20 



25 



30 



He Gly He Gly Gly He Thr Lys Phe Asp His Ser Asp Phe Asp Val 
35 40 45 

His Asn Ala Ala Glu He Gin Asp Phe Pro Phe Asp Lys Tyr Phe Val 
50 ' 55 60 



10 65 



Lys Lys Asp Thr Asn Arg Phe Asp Asn Tyr Ser Leu Tyr Ala Leu Tyr 



70 



75 



80 



Ala Ala Gin Glu Ala Val Asn His Ala Asn Leu Asp Val Glu Ala Leu 
85 90 95 

15 Asn Arg Asp Arg Phe Gly Val He Val Ala Ser Gly He Gly Gly He 



100 



105 



110 



20 



Lys Glu He Glu Asp Gin Val Leu Arg Leu His Glu Lys Gly Pro Lys 
115 120 125 

Arg Val Lys Pro Met Thr Leu Pro Lys Ala Leu Pro Asn Met Ala Ser 
130 135 • 140 



Gly Asn Val Ala Met Arg Phe Gly Ala Asn Gly Val Cys Lys Ser He 
145 150 155 



160 



Asn Thr Ala Cys Ser Ser Ser Asn Asp Ala He Gly Asp Ala Phe Ara 
165 



170 



175 



30 Ser He Lys Phe Gly Phe Gin Asp Val Met Leu Val Gly Gly Thr Glu 
180 185 190 



35 



Ala Ser He Thr Pro Phe Ala He Ala Gly Phe Gin Ala Leu Thr Ala 

195 200 205 

Leu Ser Thr Thr Glu Asp Pro Thr Arg Ala Ser He Pro Phe Asp Lvs 
210 215 220 



Asp Arg Asn Gly Phe Val Met Gly Glu Gly Ser Gly Met Leu Val Leu 
4U 225 230 235 240 

Glu Ser Leu Glu His Ala Glu Lys Arg Gly Ala Thr He Leu Ala Glu 
245 250 255 

45 Val Val Gly Tyr Gly Asn Thr Cys Asp Ala Tyr His Met Thr Ser Pro 
260 265 270 



50 



His Pro Glu Gly Gin Gly Ala He Lys- Ala He Lys Leu Ala Leu Glu 
275 280 285 

Glu Ala Glu He Ser Pro Glu Gin Val Ala Tyr Val Asn Ala His Glv 
290 295 300 



Thr Ser Thr Pro Ala Asn Glu Lys Gly Glu Ser Gly Ala He Val Ala 
^ 305 310 315 320 



Val Leu Gly Lys Glu Val Pro Val Ser Ser Thr Lys Ser Phe Thr 



Gly 
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325 330 



335 



His Leu Leu Gly Ala Ala Gly Ala Val Glu Ala He Val Thr He Glu 
5 340 345 350 

Ala Met Arg His Asn Phe Val Pro Met Thr Ala Gly Thr Ser Glu Val 
355 360 • 365 

Ser Asp Tyr He Glu Ala Asn Val Val Tyr Gly Gin Gly Leu Glu Lys 
1U . 370 375 3 8 o * 

Glu He Pro Tyr Ala He Ser Asn Thr Phe Gly Phe Gly Gly His Asn 
385 390 395 4 oO 

15 Ala Val Leu Ala Phe Lys Arg Trp Glu Asn Arg 

405 410 



<210> 200 
20 <211> 359 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 200 

25 Met Asn He Tyr Asp Gin Leu Gin Val Val Glu Asp Arg Tyr Glu Glu 
1 5 10 15 

Leu Gly Glu Leu Leu Ser Asp Pro Asp Val Val Ser Asp Thr Lys Ara 
30 20 25 30 

Phe Met Glu Leu Ser Lys Glu Glu Ala Ser Asn Arg Asp Thr Val lie 
35 40 45 

Ala Tyr Arg Glu Tyr Lys Gin Val Leu Gin Asn He Val Asp Ala Glu 
i!) 50 55 - 60 

Glu Met He Lys Glu Ser Gly Gly Asp Ala Asp Leu Glu Glu Leu Ala 
65 70 75 80 

40 Lys Gin Glu Leu Lys Asp Ala Lys Ala Glu Lys Glu Glu Tyr Glu Glu 

85 90 95 

Lys Leu Lys He Leu Leu Leu Pro Lys Asp Pro Asn Asp Asp Lys Asn 
45 100 105 no 

He lie Leu Glu He Arg Gly Ala Ala Gly Gly Asp Glu Ala Ala Leu 
115 120 125 

Phe Ala Gly Asp Leu Leu Thr Met Tyr Gin Lys Tyr Ala Glu Ala Gin 
5U 130 135 140 

Gly Trp Arg Phe Glu Val Met Glu Ala Ser Met Asn Gly Val Gly Glv 
145 . 150 155 160 

55 Phe Lys Glu Val Val Ala Met Val Ser Gly Gin Ser Val Tyr Ser Lys 

165 170 i7 5 
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Leu Lys Tyr Glu Ser Gly Ala His Arg Val Gin Arg Val Pro Val Thr 
180 185 190 

Glu Ser Gin Gly Arg Val His Thr Ser Thr Ala Thr Val Leu Val Met 
3 195 200 205 

Pro Glu Val Glu Glu Val Glu Tyr Asp lie Asp Pro Lys Asp Leu Arg 
210 215 220 

10 Val Asp He Tyr His Ala Ser Gly Ala Gly Gly Gin Asn Val Asn Lys 
225 230 235 240 

• Val Ala Thr Ala Val Arg He Val His Leu Pro Thr Asn He Lys Val 

15 



245 250 255 



Glu Met Gin Glu Glu Arg Thr Gin Gin Lys Asn Arg Glu Lys Ala Met 
260 265 270 

Lys He He Arg Ala Arg Val Ala Asp His Phe Ala Gin He Ala Gin 
20 275 280 285 

Asp Glu Gin Asp Ala Glu Arg Lys Ser Thr He Gly Thr Gly Asp Ara 
290 295. 300 • 

25 ser Glu Arg He Arg Thr Tyr Asn Phe Pro Gin Asn Arg Val Thr Asd 
• 305 310 315 320 

His Arg He Gly Leu Thr Leu Gin Lys Leu Asp Thr He Leu Ser Gly 
30 . 325 330 335 

Lys Leu Asp Glu Val Val Asp Ala Leu Val Leu Tyr Asp Gin Thr Gin 
- 340 -345 3 50 

Lys Leu Glu Glu Leu Asn Lys 
35 355 



<210> 201 
<211> 559 
40 • <212>. PRT 

<213> .Streptococcus pneumoniae 

<400> 201 

Met Ala Tyr Thr Leu Lys Pro Glu Glu Val Gly Val Phe Ala He Gly 
45 1 5 io 15 

Gly Leu Gly Glu He Gly Lys Asn Thr Tyr Gly He Glu Tyr Gin Asd 
2° 25. 30 

50 Glu He He He Val Asp Ala Gly He Lys Phe Pro Glu Asp Asp Leu 
35 40 45 

Leu Gly He Asp Tyr Val He Pro Asp Tyr Ser Tyr He Val Asp Asn 
55 ■ 50 55 60 

lie Asp Arg Val Lys Ala Val Leu He Thr His Gly His Glu Asp His 
65 70 75 80 



146 



WO 01/49721 



PCT/US00/35604 



He Gly Gly He Pro Phe Leu Leu Lys Gin Ala Asn Val Pro He Tvr 
85 ~~ * 



90 



95 



5 Ala Gly Pro Leu Ala Leu Ala Leu He Arg Gly Lys Leu Glu Glu His 
100 105 i 10 

Gly Leu Leu Arg Asn Ala Lys Leu Tyr Glu He Asn His Asn Thr Glu 
115 120 • 125 

Leu Thr Phe Lys Asn Leu Lys Ala Thr Phe Phe Arg Thr Thr His Ser 
130 135 140 

He Pro Glu Pro Leu Gly He Val He His Thr Pro Gin Gly Lvs He 
15 145 150 is.S - y 



10 



160 



Val Cys Thr Gly Asp Phe Lys Phe. Asp Phe Thr Pro Val Gly Glu Pro 
165 170 



20 Ala Asp Leu His Arg Met Ala Ala Leu Gly Glu Glu Gly Val 



180 



185 



190 



175 



Leu Cys 



25 



Leu Leu Ser Asp Ser Thr Asn Ala Glu Val Pro Thr Phe Thr Asn Ser 
195 200 205 

Glu Lys Val Val Gly Gin Ser lie Met Lys He He Gin Gly He Glu 

210 215 



220 



in ooc ^ IlG Ile Phe Ala Ser Phe Ala Ser Asn Ile p *e Arg Leu Gin 
JU 225 230 235 240 

Gin Ala Thr Glu Ala Ala Val Lys Thr Gly Arg Lys Ile Ala Val Phe 
245 250 



35 Gly Arg Se 



255 

r Met Glu Lys Ala Ile Val Asn Gly Ile Asp Leu Gly Tvr 
260 265 270 



40 



He Lys Ala Pro Lys Gly Thr Phe Ile Glu Pro Asn Glu Ile Lys Asd 
275 280 285 

Tyr Pro Ala Gly Glu Val Leu Ile Leu Cys Thr Gly Ser Gin Gly Glu 
290 295 300 



Pro Met Ala Ala Leu Ser Arg Ile Ala Asn Gly Thr His Arg Gin Val 
45 305 310 315 3 2 o 

Gin Leu Gin Pro Gly Asp Thr Val Ile Phe Ser Ser Ser Pro lie Pro 



325 



.330 



335 



50 Gly Asn Thr Thr Ser Val Asn Lys Leu He Asn Ile Ile Ser Glu Ala 
340 345 350 



Gly Val Glu Val He Hi-s Gly Lys Val Asn Asn Ile His Thr Ser 



55 



355 



360 



365 



Gly 



His Gly Gly Gin Gin Glu Gin Lys Leu Met Leu Cys Leu Ile Lys Pro 
370 375 



380 
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hys Tyr Phe Met Pro Val His Gly Glu Tyr Arg Met Gin Lys Val His 

400 



385 390 395 



5 Ala Gly Leu Ala Val Asp Thr Gly Val Glu Lys Asp Asn lie Phe II 



405 410 " * "~" Jig "° 



Met Ser Asn Gly Asp Val Leu Ala Leu Thr Ala Asp Ser Ala Arg lie 

10 



420 425 43 0 



Ala Gly His Phe Asn Ala Gin Asp He Tyr Val Asp Gly Asn Ara He 
435 440 445 

IS 116 Gly Ma Ala Val LeU L Y S As P **9 Asp Leu Ser Glu 

° 450 455 460 

Asp Gly Val Val Leu Ala Val Ala Thr Val Asp Phe Lys Ser Gin Met 
465 470 475 480 

20 He Leu Ser Gly Pro Asp lie Leu Ser Arg Gly Phe Val Tyr Met Arg- 

485 490 495 

Glu Ser Gly Asp Leu He Arg Gin Ser Gin Arg He Leu Phe Asn Ala 
25 500 505 ' 510 

lie Arg He Ala Leu Lys Asn Lys Asp Ala Ser Val Gin Ser Val Asn 
515 520 525 

™ 116 Val ASn Ala Ile Arg pro Phe Leu. Tyr Glu Asn Thr Glu 

JU 530 535 540 

Arg Glu Pro Ile Ile Ile Pro Met Ile Leu Thr Pro Asp Glu Glu 
545 550 555 

<210> 202 
<211> 450 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 202 

Met Ala Glu Val Glu Glu Leu Arg Val Gin Pro Gin Asp He Leu Ala 
1 5 io 15 

45 Glu Gin Ser Val Leu Gly Ala He Phe He Asp Glu Ser Lys Leu Val 
20 25 30 

Phe Val Arg Glu Tyr He Glu Ser Arg. Asp Phe Phe Lys Tyr Ala His 
35 40 45 

Arg Leu He Phe Gin Ala Met Val Asp Leu Ser Asp Arg Gly Aso Ala 
50 55 60 



35 



40 



50 



lie Asp Ala Thr Thr Val Arg Thr Ile Leu Asp Asn Gin Gly Asp Leu 
65 . 70 75 80 

Gin Asn He Gly Gly Leu Ser Tyr Leu Val Glu Ile Val Asn Ser Val 
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85 90 



95 



Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys He Val Ala Glu Lys 
5 100 105 110 

Ala Met Leu Arg Arg Leu lie Ala Lys Leu Thr Glu Ser Val Asn Gin 
115 120 125 

in Ma G1U Ala Ser Gln Pro Ala As P Glu Ile Ile Ala Gin Ala Glu 
1U 130 135 14Q 

Lys Gly Leu Ile Asp Val Ser Glu Asn Ala Asn Arg Ser Gly Phe Lys 



145 150 155 



160' 



15 Asn Ile Arg Asp Val Leu Asn Leu Asn Phe Gly Asn Leu Glu Ala Arg 

165 no i 7 5 

Ser Gin Gin Thr Thr Asp Ile Thr Gly lie Ala Thr Gly Tyr Arcr Asd 
180 iss 190 



20 



Leu Asp His Met Thr Thr Gly Leu His Glu Glu Glu Leu Ile Ile Leu 
195 200 205 



Ala Ala Arg Pro Ala Val Gly Lys Thr Ala Phe Ala Leu Asn He Ala 
^ 210 215 220 

Gin Asn Ile Gly Thr Lys Leu Asp Lys Thr Val Ala Ile Phe Ser Leu 
225 230 235 240 

30 Glu Met Gly Ala Glu Ser Leu Val Asp Arg Met Leu Ala Ala Glu Gly 

245 250 -255 

Leu Val Glu Ser His Ser Ile Arg Thr Gly Gin Leu Thr Asp Glu Glu 
260 • 265 270 

Trp Gin Lys Tyr Thr Ile Ala Gin Gly Asn Leu Ala Asn Ala Ser Ile 
275 280 285 



35 



4ft lln ASP ASP Thr Pr ° Gly Ile Ar 9 Ile Thr Glu He Arg Ser Arg 

W 290 . 295 300 

Ser Arg Lys Leu Ala Gin Glu Thr Gly Asn Leu Gly Leu Ile Val Ile 
305 310 •aic 

J±u 315 320 

45 Asp Tyr Leu Gin Leu Ile Thr Gly Thr Gly Arg Glu Asn Arg Gin Gin 

325 ' 330 335 

Glu Val Ser Glu Ile Ser Arg Gin Leu-Lys Ile Leu Ala Lys Glu Leu 
50 340 345 350 

Lys Val Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly Val Glu Gin 
355 360 365 

« Ar9 Asp Lys Ar 9 Pro Val Le u Ser Asp Ile Arg Glu Ser Gly Ser 
370 375 3 8 o 



Ile Glu Gin Asp Ala Asp Ile Val Ala Phe Leu Tyr Arg Asp Asp 



Tyr 
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385 390 395 400 

Tyr Glu Arg Gly Gly Glu Glu Glu Glu Gly He Pro Asn Asn Lys Val 
5 405 410 4l5 

Glu Val He He Glu Lys Asn Arg Ser Gly Ala Arg Gly* Thr Val Glu 
420 425 430 

■ Leu He Val Gin Lys Glu Tyr Asn Lys Phe Ser Ser He Ser Lys Arq 
1U 435 440 445 

Glu Ala 
450 



15 



20 



<210> 203 
<211> 699 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 203 



Met Ala Thr Ala Thr Lys Lys Lys Lys Ser Thr Val Lys Lys Asn Leu 
1 5 . io 15 

25 Val He Val Glu Ser Pro Ala Lys Ala Lys Thr He Glu Lys Tyr Leu 
20 25 30 - 

Gly Arg Asn Tyr Lys Val Leu Ala Ser Val Gly His He Arg Asp Leu 
35 40 45 

30 q ° 

Lys Lys Ser Ser Met Ser Val Asp He Glu Asn Asn Tyr Glu Pro Gin 
5 <> " 55 60 

Tyr He Asn He Arg Gly Lys Gly Pro Leu He Asn Asp Leu Lys Lys 
■« 65 70 75 i~ 



80 



Glu Ala Lys Lys Ala Asn Lys Val Phe Leu Ala Ser Asp Pro Asp Ara 
85 90 95 y 

40 Glu Gly Glu Ala He Ser Trp His Leu Ala His He Leu Asn Leu Asp 
100 105 110 

Glu Asn Asp Ala Asn Arg Val Val Phe Asn Glu He Thr Lys Asp Ala 
45 115 120 ' 125 

Val Lys Asn Ala Phe Lys Glu Pro Arg Lys He Asp Met Asp Leu Val 
130 135 140 

Asp Ala Gin Gin Ala Arg Arg He Leu Asp Arg Leu Val Gly Tyr Ser 
50 145 150 155 leo 

He Ser Pro He Leu Trp Lys Lys Val Lys Lys Gly Leu Ser Ala Gly 
165 170 175 

55 Arg Val Gin Ser He Ala Leu Lys Leu He He Asp Arg Glu Asn Glu 
180 185 190 
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He Asn Ala Phe Gin Pro Glu Glu Tyr Trp Thr Val Asp Ala Val Phe 
195 200 ' 205 

Lys Lys Gly Thr Lys Gin Phe His Ala Ser Phe Tyr Gly Val Asp Gly 
-> 210 215 220 

Lys Lys Met Lys Leu Thr Ser Asn Asn Glu Val Lys Glu Val Leu Ser 
225 230 235 240 

10 Arg Leu Thr Ser Lys Asp Phe Ser Val Asp Gin Val Asp Lys Lys Glu 

245 250 255 



15 



Arg Lys Arg Asn Ala Pro Leu Pro Tyr Thr Thr Ser Ser Met Gin Met 
260 265 270 

Asp Ala Ala Asn Lys He Asn Phe Arg Thr Arg Lys Thr Met Met Val 
275 280 285 

Ala Gin Gin Leu Tyr Glu Gly He Asn He Gly Ser Gly Val Gin Gly 
20 290 295 300 

Leu He Thr Tyr Met Arg Thr Asp Ser Thr Arg He Ser Pro Val Ala 
305 310 315 320 

25 Gin Asn Glu Ala Ala Ser Phe He Thr Asp Arg Phe Gly Ser Lys Tyr 

325 330 335 



30 



Ser Lys His Gly Ser Lys Val Lys Asn Ala Ser Gly Ala Gin Asp Ala 
340 345 350 

His Glu Ala He Arg Pro Ser Ser Val Phe Asn Thr Pro Glu Ser He 
355 360 365 



Ala Lys Tyr Leu Asp Lys Asp Gin Leu Lys Leu Tyr Thr Leu He Trp 
55 370 375 380 

Asn Arg Phe Val Ala Ser Gin Met Thr Ala' Ala Val Phe Asp Thr Met 
385 3 90 . 395 400 

40 Ala Val Lys Leu Ser Gin Lys Gly Val Gin Phe Ala Ala Asn Gly Ser 

405 410 415 

Gin Val Lys Phe Asp Gly Tyr Leu Ala He Tyr Asn Asp Ser Asp Lys 
45 " 420 425 4 30 

Asn Lys Met Leu Pro Asp Met Val Val Gly Asp Val Val Lys Gin Val 
435 440 445 

Asn Ser Lys Pro Glu Gin His Phe Thr Gin Pro Pro Ala Arg Tvr Ser 
50 450 .455 .460 • 

Glu Ala Thr Leu He Lys Thr Leu Glu Glu Asn Gly Val Gly Arg Pro 
465 470 475 • 480 

55 Ser Thr Tyr Ala Pro Thr He Glu Thr He Gin Lys Arg Tyr Tyr Val 

485 490 495 
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Arg Leu Ala Ala Lys Arg Phe Glu Pro Thr Glu Leu Gly Glu lie Val 
500 505 510 

sn Lys Leu lie Val Glu Tyr Phe Pro Asp lie Val Asn Val Thr Phe 
515 5 2o 525 

Thr Ala Glu Met Glu Gly Lys Leu Asp Asp Val Glu Val Gly Lys Glu 
SJU 535 540 

Gin Trp Arg Arg Val lie Asp Ala Phe Tyr Lys Pro Phe Ser Lys Glu 
■ 550 55 5 560 

Val Ala Lys Ala Glu Glu Glu Met Glu Lys He Gin He L ys Asp Glu 
565 570 57 £ 

Pro Ala Gly Phe Asp Cys Glu Val Cys Gly Ser Pro Met Val He Lys 
580 585 590 *■ 

20 Leu Gly Arg Phe Gly Lys Phe Tyr Ala Cys Ser Asn Phe Pro Asp Cys 
595 soo — J 



605 



25 



Arg His Thr Gin Ala He Val Lys Glu lie Gly Val Glu Cys Pro Ser 
e±u gj5 



620 



Cys His Gin Gly Gin lie lie Glu Arg Lys Thr Lys Arg Asn Arg Leu 
25 630 635 640 

Phe Tyr Gly Cys Asn Arg Tyr Pro Glu Cys Glu Phe Thr Ser Trp Asp 
645 650 65 £ 

Lys Pro Val Gly Arg Asp Cys. Pro Lys Cys Gly Asn Phe Leu Met Glu 
660 6 65 670 

35 Lys Lys Val Arg Gly Gly Gly Lys Gin Val Val Cys Ser Lys Gly Asp 
675 680 -685 

Tyr Glu Glu Glu Lys Met Ala Leu Cys Gin Leu 

40 

<210> 204 
<211> 326 
<212> PRT 
45 <213> Streptococcus pneumoniae 

<400> 204 

Met Phe lie Ser He Ser Ala Gly Ile~ Val Thr Phe Leu Leu Thr Leu 
50 5 10 15 

Val Gly He Pro Ala Phe He Gin Phe Tyr Arg Lys Ala Gin .He Thr 
20 25 30 

^ Gly Gin Gin Met His Glu Asp Val Lys Gin His Gin Ala Lys Ala Gly 
35 40 .45 

Thr Pro Thr Met Gly Gly Leu Val Phe Leu lie Thr Ser Val Leu Val 
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50 55 60 

Ala Phe Phe Phe Ala Leu Phe Ser Ser Gin Phe Ser Asn Asn Val Glv 
5 65 70 75 Q l 

Met lie Leu Phe He Leu Val Leu Tyr Gly Leu Val Gly Phe Leu Asp 
85 9 o 95 

Asp Phe Leu Lys Val Phe Arg Lys He Asn Glu Gly Leu Asn Pro Lys 
10 100 105 no 

Gin Lys Leu Ala Leu Gin Leu Leu Gly Gly Val He Phe Tyr Leu Phe 
115 120 125 

15 Tyr Glu Arg Gly Gly Asp Met Leu Ser Val Phe Gly Tyr Gin Val His 
130 135 140 

Leu Gly He Phe Tyr He Val Phe Ala Leu Phe Trp Leu Val Gly Phe 
2Q 145 150 155 160 

Ser Asn Ala Val Asn Leu Thr Asp Gly Val Asp Gly Leu Ala Ser He 
165 . 170 175 

Ser Val Val He Ser Leu Ser Ala Tyr Gly Val He Ala Tyr Val Gin 
^ 5 180 185 190 

Gly Gin Met Asp He Leu Leu Val He Leu Ala Met He Gly Gly Leu 
195 200 205 

30 Leu Ser Phe Phe lie Phe Asn His Lys Pro Ala Lys He Phe Met Glv 
210 215 220 

Asp Val Gly Ser Leu Ala Leu Gly Gly Met Leu Ala Ala He Ser Met 
35 225 230 235 240 

Ala Leu His Gin Glu Trp Thr Leu Leu He He Gly He Val Tyr Val 
245 250 255 

Phe Glu Thr Thr Ser Val Met Met Gin Val Ser Tyr Phe Lys Leu Thr 
W 260 265 270 

Gly Gly Lys Arg lie Phe Arg Met Thr Pro Val His His His Phe Glu- 
275 280 285 

45 Leu Gly Gly Leu Ser Gly Lys Gly Asn Pro Trp Ser Glu Trp Lys Val 
290 295 300 

Asp Phe Phe Phe Trp Gly Val Gly Leu teu Ala Ser Leu Leu Thr Leu 
5Q 305 310 315 320 

Ala He Leu Tyr Leu Met 
325 

55 <210> 205 
<211> 693 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 205 

Met Ala Arg Glu Phe Ser Leu Glu Lys Thr Arg Asn lie Gly lie Met 
1 5 10 



15 



Ala His Val Asp Ala Gly. Lys Thr Thr Thr Thr Glu Arg lie Leu Tyr 
20 25 30 

10 Tyr Thr Gly Lys lie His Lys lie Gly Glu Thr His Glu Gly Ala Ser 
35 40 45 



15 



20 



Gin Met Asp Trp Met Glu Gin Glu Gin Glu Arg Gly He Thr He Thr 
50 55 . 6 o 

Ser Ala Ala Thr Thr Ala Gin Trp Asn Asn His Arg Val Asn lie lie 
65 70 .75 so 

Asp Thr Pro Gly. His Val Asp Phe Thr lie Glu Val Gin Arg Ser Leu 
85 90 95 

Arg Val Leu Asp Gly Ala Val Thr Val Leu Asp Ser Gin Ser Gly Val 
100 105 110 

25 Glu Pro Gin Thr Glu Thr Val Trp Arg Gin Ala Thr Glu Tyr Gly Val 
115 120 125 

Pro Arg lie Val Phe Ala Asn Lys Met Asp Lys He Gly Ala Asp Phe 
30 130 135 140 

Leu Tyr Ser Val Ser Thr Leu His Asp Arg Leu Gin Ala Asn Ala His 
145 " 150 155 160 

Pro He Gin Leu Pro He Gly Ser Glu Asp Asp Phe Arg Gly He He 
165 ivo 175 

Asp Leu He Lys Met Lys Ala Glu He Tyr Thr Asn Asp Leu Gly Thr 
180 185 i9o 

40 Asp He Leu Glu Glu Asp lie Pro Ala Glu Tyr Leu Asp Gin Ala Gin 
195 200 205 

Glu Tyr Arg Glu Lys Leu He Glu Ala Val Ala Glu Thr Asp Glu Glu 
45 210 215 220 

Leu Met Met Lys Tyr Leu Glu Gly Glu Glu He Thr Asn Glu Glu Leu 
225 230 235 24Q 

Lys Ala Gly He Arg Lys Ala Thr He Asn Val Glu Phe Phe Pro Val 
3U 245 250 255 

Leu Cys Gly Ser Ala Phe Lys Asn Lys Gly Val Gin Leu Met Leu Asp 
260 265 270 

55 Ala Val lie Asp Tyr Leu Pro Ser Pro Leu Asp He Pro Ala He Lys 
275 280 285 
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Gly lie Asn Pro Asp Thr Asp Ala Glu Glu He Arg Pro Ala Ser Asp 

290 295 300 

Glu Glu Pro Phe Ala Ala Leu Ala Phe Lys He Met Thr Asp Pro Phe 



310 315 



320 



Val Gly Arg Leu Thr Phe Phe Arg Val Tyr Ser Gly Val Leu Gin Ser 
325 330 335 

10 Gly Ser Tyr Val Leu Asn Thr Ser Lys Gly Lys Arg Glu Ara He Glv 

340 345 350 



15 



Arg He Leu Gin Met His Ala Asn Ser Arg Gin Glu He Asp Thr Val 
355 360 365 

Tyr Ser Gly Asp He Ala Ala Ala Val Gly Leu Lys Asp Thr Thr Thr 
370 375 380 



Gly Asp Ser Leu Thr Asp Glu Lys Ala Lys He He Leu Glu Ser He 
20 385 390 395 400 

Asn Val Pro Glu Pro Val He Gin Leu Met Val Glu Pro Lys Ser Lys 
405 410 * 415 

25 Ala Asp Gin Asp Lys Met Gly He Ala Leu Gin Lys Leu Ala Glu Glu 
420 425 430 



30 



Asp Pro Thr Phe Arg Val Glu Thr Asn Val Glu Thr Gly Glu Thr Val 
435 440 445 

lie Ser Gly Met Gly Glu Leu His Leu Asp Val Leu Val Asp Arq Met 
450 '455 460 



Arg Arg Glu Phe Lys Val Glu Ala Asn Val Gly Ala Pro Gin Val Ser 

480 



35 465 470 475 



Tyr Arg Glu Thr Phe Arg Ala Ser Thr Gin Ala Arg Gly Phe Phe Lys 
485 490 495 

40 Arg Gin Ser Gly Gly Lys Gly Gin Phe Gly Asp Val Trp He Glu Phe 
500 505 sio 

Thr Pro Asn Glu Glu Gly Lys Gly Phe Glu Phe Glu Asn Ala He Val 
45 515 520 525 

Gly Gly Val Val Pro Arg Glu Phe He Pro Ala Val Glu Lys Gly Leu 
530 535 540 

Val Glu Ser Met Ala Asn Gly Val Leu Ala Gly Tyr Pro Met Val Asd 
50 545 550 555 560 

Val Lys Ala Lys Leu Tyr Asp Gly Ser Tyr His Asp Val Asp Ser Ser 
565 570 ■ 575 

55 Glu Thr Ala Phe Lys lie Ala Ala Ser Leu Ser Leu Lys Glu Ala Ala 
580 585 590 
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Lys Ser Ala Gin Pro Ala He Leu Glu Pro Met Met Leu Val Thr He 
595 600 605 

Thr Val Pro Glu Glu Asn Leu Gly Asp Val Met Gly His Val Thr Ala 
5 610 615 620 

Arg Arg Gly Arg Val Asp Gly Met Glu Ala His Gly Asn Ser Gin He 
625 - 630 635 640 

10 Val Arg Ala Tyr Val Pro Leu Ala Glu Met Phe Gly Tyr Ala Thr Val 

645 650 655 



15 



30 



40 



Leu Arg Ser Ala Ser Gin Gly Arg Gly Thr Phe Met Met Val Phe Asp 
660 665 670 

His Tyr Glu Asp Val Pro Lys Ser Val Gin Glu Glu He He Lys Lys 
675 680 685 



Asn Lys Gly Glu Asp 
20 690 



<210> 206 
<211> 408 
25 <212> PRT 

<213> Streptococcus pneumoniae 



<400> 206 

Met Pro Asn Tyr Asn He Pro Phe Ser Pro Pro Asp He Thr Glu Ala 
1 5 10 15 

Glu lie Ala Glu Val Ala Asp Thr" Leu Arg Ser Gly Trp He Thr Thr 
20 25 30 

35 Gly Pro Lys Thr Lys Glu Leu Glu Arg Arg Leu Ser Leu Tyr Thr Gin 
35 40 45 



Thr Pro Lys Thr Val Cys Leu Asn Ser Ala Thr Ala Ala Leu Glu Leu 
50 55 60 

He Leu Arg Val Leu Glu Val Gly Pro Gly Asp Glu Val He Val Pro 

65 70 75 80 



Ala Met Thr Tyr Thr Ala Ser Cys Ser Val He Thr His Val Gly Ala 
45 85 90 95 

Thr Pro Val Met Val Asp He Gin Ala Asp Thr Phe Glu Met Asp Tyr 
100 105 - no 

50 Asp Leu Leu Glu Gin Ala He Thr Glu Lys Thr Lys Val He He Pro 
H5 120 125 

Val Glu Leu Ala Gly He Val Cys Asp Tyr Asp Arg Leu Phe Gin Val 
130 135 140 

55 

Val Glu Lys Lys Arg Asp Phe Phe Thr Ala Ser Ser Lys Trp Gin Lys 
145 150 155 160 
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Ala Phe Asn Arg He Val He Val Ser Asp Ser Ala His Ala Leu Gly 
165 no 175 

5 Ser Thr Tyr Lys Gly Gin Pro Ser Gly Ser He Ala Asp Phe Thr Ser 
180 185 190 

Phe Ser Phe His Ala Val Lys Asn Phe Thr Thr Ala Glu Gly Gly Ser 
1Q 195 200 205 

Ala Thr Trp Lys' Ala -Asn Pro Val He Asp Asp Glu Glu Met Tvr Lvs 
210 ■ 215 220 



Glu Phe Gin He Leu Ser Leu His Gly Gin Thr Lys Asp Ala Leu Ala 

240 



15 225 230 235 



Lys Met Gin Leu Gly Ser Trp Glu Tyr Asp He Val Thr Pro Ala Tvr 
245 250 255 

20 Lys Cys Asn Met Thr Asp He Met Ala Ser Leu Gly Leu Val Gin Leu 
260 265 270 

Asp Arg Tyr Pro Ser Leu Leu Gin Arg Arg Lys Asp He Val Asp Ara 
25 , 275 280 285 

Tyr Asp Ser Gly Phe Ala Gly Ser Arg He His Pro Leu Ala His Lvs 
290 295 300 

Thr Glu Thr Val Glu Ser Ser Arg His Leu Tyr He Thr Arg Val Glu 
30 305 310 315 320 

Gly Ala Ser Leu Glu Glu Arg Ser Leu He He Gin Glu Leu Ala Lys 
325 330 335 

35 Ala Gly He Ala Ser Asn Val His Tyr Lys Pro Leu Pro Leu Leu Thr 
340 345 350 

Ala Tyr Lys Asn Leu Gly Phe Asp Met Thr Asn Tyr Pro Lys Ala Tyr 
4Q 355 360 365 

Ala Phe Phe Glu Asn Glu lie Thr Leu Pro Leu His Thr Lys Leu Ser 
370 375 380 



Asp Glu Glu Val Asp Tyr He He Glu Thr Phe Lys Thr Val Ser Glu 

400 



45 385 390 395 



Lys Val Leu Thr Leu Ser Lys Lys 
405 

50 

<210> 207 
<211> 325 
<212> PRT 
^ <213> Streptococcus pneumoniae 

<400> 207 

Met Thr Glu Pro Asp Phe Trp Asn Asp Asn He Ala Ala Gin Lys Thr 
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10 



15 



Ser Gin Glu Leu Asn Val Phe Lys Asn Thr Tyr Asn Thr Phe His Lvs 

20 25 30 

Met Glu Glu Leu Gin Asp Glu Val Glu He Leu Leu Asp Phe Leu Ala 

35 40 45 



10 



Glu Asp Glu Ser Val His Asp Glu Leu Val Ala Gin Leu Ala Glu Leu 
50 55 60 



Asp Lys He Met Thr Ser Tyr Glu Met Thr Leu Leu Leu Ser Glu Pro 

65 " ™ ' 75 eo 

15 Tyr Asp His Asn Asn Ala He Leu Glu He His Pro Gly Ser Glv Glv 

85 90 95 



20 



25 



Thr Glu Ala Gin Asp Trp Gly Asp Met Leu Leu Arg Met Tyr Thr Arq 
100 105 no 

Tyr Gly Asn Ala Lys Gly Phe Lys Val Glu Val Leu Asp Tyr Gin Ala 
115 120 125 

Gly Asp Glu Ala Gly He Lys Ser Val Thr Leu Ser Phe Glu Gly Pro 



130 



135 



140 



Asn Ala Tyr Gly Leu Leu Lys Ser Glu Met Gly Val His Arg Leu Val 
145 150 155 160 

30 Arg He Ser Pro Phe Asp Ser Ala Lys Arg Arg His Thr Ser Phe Thr 

165 170 175 

Ser Val Glu Val Met Pro Glu Leu Asp Asp Thr He Glu Val Glu He 
180 185 190 



35 



Arg Glu Asp Asp He Lys Met Asp Thr Phe Arg Ser Gly Gly Ala Glv 
195 200 205 



Gly Gin Asn Val Asn Lys Val Ser Thr Gly Val Arg Leu Thr His lie 
4U 210 215 



220 



Pro Thr Gly He Val Val Gin Ser Thr Val Asp Arg Thr Gin Tyr Gly 
225 230 235 



240 



45 Asn Arg Asp Arg Ala Met Lys- Met Leu Gin Ala Lys Leu Tyr Gin Met 

245 250 255 



50 



Glu Gin Glu Lys Lys Ala Ala Glu Val- -Asp Ser Leu Lys Gly Glu Lys 
260 265 270 

Lys Glu He Thr Trp Gly Ser Gin He Arg Ser Tyr Val Phe Thr Pro 
275 280 285 



Tyr Thr Met Val Lys Asp His Arg Thr Ser Phe Glu Val Ala Gin Val 
290 295 



300 



Asp Lys Val Met Asp Gly Asp Leu Asp Gly Phe He Asp Ala Tyr Leu 
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305 3 10 315 320 

Lys Trp Arg He Ser 
325 

5 

<210> 208 
<211> 249 
<212> PRT 
10 <213> Streptococcus pneumoniae 

<400> 208 

Met Phe Tyr Thr Tyr Leu Arg Gly Leu Val Val Leu Leu Leu Trp Ser 

15 15 10 

He Asn Gly Asn Ala His Tyr His Asn Thr Asp Lys He Pro Asn Gin 
20 25 30 

Asp Glu Asn Tyr He Leu Val Ala Pro His Arg Thr Trp Trp Asp Pro 
ZU 35 40 45 

Val Tyr Met Ala Phe Ala Thr Lys Pro Lys Gin Phe He Phe Met Ala 
50 55 60 

25 Lys Lys Glu Leu Phe Thr Asn Arg He Phe Gly Trp Trp lie Arg Met 
65 70 75 80 

Cys Gly Ala Phe Pro He Asp Arg Glu Asn Pro Ser Ala Ser Ala He 
30 85 . 90 95 

Lys Tyr Pro He Asn Val Leu Lys Lys Ser Asp Arg Ser Leu He Met 
100 105 no 

Phe Pro Ser Gly Ser Arg His - Ser Asn Asp Val Lys Gly Gly Ala Ala 
• 3 115 120 125 

Leu He Ala Lys Met Ala Lys Val Arg He Met Pro Val Thr Tyr Thr 
13 0 135 140 

40 Gly Pro Met Thr Leu Lys Gly Leu He Ser Arg Glu Arg Val Asp Met 
145 . 150 155 i 6 o 



Asn Phe Gly Asn Pro He Asp He Ser Asp He Lys Lys Met Asn Asd 
165 nn ^ ^ 

45 



165 170 175. 



Glu Gly He Glu Thr Val Ala Asn Arg He Gin Thr Glu Phe Gin Arg 

180 185.... 190 

Leu Asp Glu Glu Thr Lys Gin Trp His Asn Asp Lys Lys Pro Asn Pro 
30 195 200 205 

Leu Trp Trp Phe He Arg He Pro Ala Leu He Leu Ala He lie Leu 
210 215 220 

55 Ala He Leu Thr He He Phe Ser Phe He Ala Ser Phe He Trp Asn 
225 230 235 240 
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Pro Asp Lys Lys Arg Glu Glu Leu Ala 
245 



5 <210> 209 
<211> 1033 
<212> PRT 

<213> Streptococcus pneumoniae 



10 <400> 209 

Met lie Ala Gin Leu Asp Thr Lys Thr Val Tyr Ser Phe Met Glu Ser 
1 5 10 



15 



Val lie Ser lie Glu Lys Tyr Val Arg Ala Ala Lys Glu Tyr Gly Tyr 
20 25 30 

Thr His Leu Ala Met Met Asp He Asp Asn Leu Tyr Gly Ala Phe Asd 
35 40 45 F 

20 Phe Leu Glu He Thr Lys Lys Tyr Gly lie. His Pro Leu Leu Gly Leu 
50 55 6 o 

Glu Met Thr Val Phe Val Asp Asp Gin Gly Val Asn Leu Arg Phe Leu 

25 70 75 80 

Ala Leu Ser Ser Val Gly Tyr Gin Gin Leu Met Lys Leu Ser Thr Ala 
85 90 95 

Lys Met Gin Gly Glu Lys Thr Trp Ser Val Leu Ser Gin Tyr Leu Glu 
JU 100 105 110 

Asp He Ala Val He Val Pro Tyr Phe Asp Arg Val Glu Ser Leu Glu 
115 , 120 125 

35 Leu Gly Cys Asp Tyr Tyr He Gly Val Tyr Pro Glu Thr Leu Ala Ser 
130 135 140 

Glu Phe His His Pro He Leu Pro Leu Tyr Arg Val Asn Ala Phe Glu 

40 150 ' 155 160 

Ser Arg Asp Arg Glu Val Leu Gin Val Leu Thr Ala He Lys Glu Asn 
165 170 175 

Leu Pro Leu Arg Glu Val Pro Leu Arg Ser Arg Gin Asp Val Phe He 
n 180 185 190 

Ser Ala Ser Ser Leu Glu Lys Leu Phe Gin Glu Arg Phe Pro Gin Ala 
195 200 ' 205 

50 Leu Asp Asn Leu Glu Lys Leu He Ser Gly He Ser Tyr Asp Leu Asp 
210 215 220 



Thr Ser Leu Lys Leu Pro Arg Phe Asn Pro Ala Arg Pro Ala Val Glu 
55 230 235 240 



Glu Leu Arg Glu Arg Ala Glu Leu Gly Leu Val Gin Lys Gly Leu Thr 
245 250 255 
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Ser Lys Glu Tyr Gin Asp Arg Leu Asp Gin Glu Leu Ser Val He His 
260 265 270 

Asp Met Gly Phe Asp Asp Tyr Phe Leu Val Val Trp Asp Leu Leu Arq 
275 280 285 



10 



15 



20 



Phe Gly Arg Ser Asn Gly Tyr Tyr Met Gly Met Gly Arg Gly Ser Ala 
290 295 300 

Val Gly Ser Leu Val Ser Tyr Ala Leu Asp He Thr Gly He Asp Pro 
305 310 315 3 2 o 

Val Glu Lys Asn Leu lie Phe Glu Arg Phe Leu Asn Arg Glu Arq Tvr 
325 330 335 

Thr Met Pro Asp He Asp He Asp He Pro Asp He Tyr Arg Pro Asp 
340 345 350 

Phe He Arg Tyr Val Gly Asn Lys Tyr Gly Ser Lys His Ala Ala Gin 
355 360 365 



-25 



30 



35 



He Val Thr Phe Ser Thr Phe Gly Ala Lys Gin Ala Leu Arg Asp Val 
370 . 375 380 

Leu Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Ser Ala He Thr Lys 
385 - 390 395 4 00 

Lys He Ser Phe Arg Asp Asn Leu Lys Ser Ala Tyr Glu Gly Asn Leu 
405 410 415 

'4 

Gin Phe Arg Gin Gin lie Asn Ser Lys Leu Glu Tyr Gin Lys Ala Phe 
420 425 430 

Glu He Ala Cys Lys He Glu Gly Tyr Pro Arg Gin Thr Ser Val His 
435 440 445 



40 



45 



Ala Ala Gly Val Val lie Ser Asp Gin Asp Leu Thr Asn Tyr He Pro 
450 455 460 

Leu Lys Tyr Gly Asp Glu lie Pro Leu Thr Gin Tyr Asp Ala His Gly 
465 470 475 480 

Val Glu Ala Ser Gly Leu Leu Lys Met Asp Phe Leu Gly Leu Arg Asn 
485 490 495 



50 



Leu Thr Phe Val Gin Lys Met Gin Glu Leu Leu Ala Glu lie Glu Gly 
500 505 - - 510 

He His Leu Lys lie Glu Glu lie Asp Leu Glu Asp Lys Glu Thr Leu 
515 520 525 



55 



Asp Leu Phe Ala Ser Gly Asn Thr Lys Gly lie Phe Gin Phe Glu Gin 
530 535 540 

Pro Gly Ala He Arg Leu Leu Lys Arg. Val Gin Pro Val Cys Phe Glu 

545 550 555 560 
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Asp Val Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser Asd Tvr 
565 570 57 f 

5 He Asn Asn Phe Val Ala Arg Lys His Gly Gin Glu Glu Val Thr Val 
580 585 590 



10 



Leu Asp Pro Val Leu Glu Asp He Leu Ala Pro Thr Tyr Gly He Met 
595 600 605 

Leu Tyr Gin Glu Gin Val Met Gin Val Ala Gin Arg Phe Ala Gly Phe 
61 Q 615 620 



15 625 



Ser Leu Gly Lys Ala Asp He Leu Arg Arg Ala Met Gly Lys Lys Asp 



630 



635 



64 0 



Ala Ser Ala Met His Glu Met Arg Ala Ser Phe lie Gin Gly Ser Leu 

645 650 655 

20 Glu Ala Gly His Thr Val Glu Lys Ala Glu Gin Val Phe Asp Val Met 
660 665 670 



25 



Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Tyr Ala Tyr 
675 680 685 

Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Thr His Tyr Pro Ala 
690 695 700 



He Phe Tyr Gin He Met Leu Asn Ser Ala Asn Ser Asp Tyr Leu II 
30 ? 05 710 715 no 



720 



Asp Ala Leu Glu Ala Gly Phe Glu Val Ala* Pro Leu Ser He Asn Thr 
725 730 735 

35 He Pro Tyr His- Asp Lys He Ala Asn Lys Ala He Tyr Leu Gly Leu 
740 745 750 



40 



Lys Ser He Lys Gly Val Ser Asn Asp Leu Ala Leu Trp He He Glu 
75 5 760 765 

His Arg Pro Tyr Ser Asn He Glu Asp Phe He Ala Lys Leu Pro Glu 
770 775 780 



Asn Tyr Leu Lys Leu Pro Leu Leu Glu Pro Leu Val Lys Val Glv Leu 
45 785 790 795 y 800 

Phe Asp Ser Phe Glu Lys Asn Arg Gin Lys Val Phe Asn Asn Leu Ala 
805 '810 815 

50 Asn -Leu Phe Glu Phe Val Lys Glu Leu Gly Ser Leu Phe Gly Asp Ala 
820 825 830 



55 



He Tyr Ser Trp Gin Glu Ser Glu Asp Trp Thr Glu Gin Glu Lys Phe 
835 840 845 

Tyr Met Glu Gin Glu Leu Leu Gly lie Gly Val Ser Lys His Pro Leu 
850 855 860 
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Gin Ala lie Ala Ser Lys Ala He Tyr Pro He Thr Pro He Glv Asn 
865 8 ™ 875 880 

5 Leu Ser Glu Asn Ser Tyr Ala He He Leu Val Glu Val Gin Lys He 
885 8 9o . 8 ^ 5 

Lys Val He Arg Thr Lys Lys Gly Glu Asn Met Ala Phe Leu Gin Ala 
10 900 905 910 

Asp Asp Ser Lys Lys Lys Leu Asp Val Thr Leu Phe Ser Asp- Leu Tyr 
915 920 925* 

Arg Gin Val Gly Gin Glu He Lys Glu Gly Ala Phe Tyr Tyr Val Lys 
A -> 930 935 940 

Gly Lys He Gin Ser Arg Asp Gly Arg Leu Gin Met He Ala Gin Glu 
945 950 955 960 

20 He Arg Glu Ala Val Ala Glu Arg Phe Trp He Gin Val Lys Asn His 

965 970 975 



25 



35 



40 



50 



Glu Ser Asp Gin Glu lie Ser Arg lie Leu Glu Gin Phe Lys Gly Pro 
980 985 990 

lie Pro Val He lie Arg Tyr Glu Glu Glu Gin Lys Thr He Val Ser 
. 995 1000 1005 



Pro His His Phe Val Ala Lys Ser Asn Glu Leu Glu Glu Lys Leu Asn 
30 1010 . 1015 1020 



Glu He Val Met Lys Thr He Tyr Arg 
1025 1030 



<210> 210 
<211> 306 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 210 



Met Thr Asn Glu Phe Leu His Phe Glu Lys He Ser Arg Gin Thr Tro 
1 5 io 15 

45 Gin Ser Leu His Arg Lys Thr Thr Pro Pro Leu Thr Glu Glu Glu Leu 
20 25 30 



Glu Ser He Lys Ser Phe Asn Asp Gin lie Ser Leu Gin Asp Val Thr 
35 40 45 

Asp lie Tyr Leu Pro Leu Ala His Leu lie Gin lie Tyr Lys Arq Thr 
50 55 eo 



Lys Glu Asp Leu Ala Phe Ser Lys Gly He Phe Leu Gin Arg Glu Ser 
55 65 70 75 80 

Lys Ser Gin Pro Phe He lie Gly Val Ser Gly Ser Val Ala Val Gly 
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85 90 95 

Lys Ser Thr Thr Ser Arg Leu Leu Gin He Leu Leu Ser Arg Thr Phe 
5 100 105 uo- 

Thr Asp Ala Thr Val Glu Leu Val Thr Thr Asp Gly Phe Leu Tyr Pro 
115 120 125 

Asn Gin Thr Leu He Glu Gin Gly He Leu Asn Arg Lys Gly Phe Pro 
1U 130 135 ■ 140 

Glu Ser Tyr Asp Met Glu Ala Leu Leu Asn Phe Leu Asp Arq He Lvs 
145 150 155 do 

15 Asn Gly Gin Asp Val Asp He Pro Val Tyr Ser His Glu Val Tyr Asp 

165 170 175 

He Val Pro Lys Lys Lys Gin Ser Val Lys Ala Ala Asp Phe Val He 
2Q 180 185 i 9 o 



Val Glu Gly lie Asn Val Phe Gin Asn Pro Gin Asn Asp Arg Leu Tvr 
195 200 205 

He Thr Asp Phe Phe Asp Phe Ser lie Tyr Val Asp Ala Gly Val Asd 



25 210 215 220 

Asp lie Glu Ser Trp Tyr Leu Asp Arg Phe Leu Lys Met Leu Ser Leu 
225 230 235 240 

30 Ala Gin Asn Asp Pro Asp Ser Tyr Tyr Tyr Arg Phe Thr Gin Met Pro 
. 245 250 255 

He Gly Glu Val Glu Ser Phe Ala His Gin Val Trp Thr Ser He Asn 
260 265 270 

Leu Thr Asn Leu Gin Asn Tyr He Glu Pro Thr Arg Asn Arg Ala Glu 
' 275 280 285 

Val He Leu His Lys Ser Lys Asn His Glu He Asp Glu He Tvr Leu 
4U 290 295 



35 



300 



45 



50 



Lys Lys 
305 



<210> 211 
<211> 246 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 211 

Met Glu He Ser Leu Leu Thr Asp Val Gly Gin Lys Arg Thr Asn Asn 

1 5 10 15 . • 

55 Gin Asp Tyr Val Asn His Tyr Val Asn Arg Ala Gly Arg Thr Met He 
20 25 30 



164 



WO 01/49721 



PCT/USOO/35604 



lie Leu Ala Asp Gly Met Gly Gly His Arg Ala Gly Asn lie Ala Ser 
35 40 45 

Glu -Met Ala Val Thr Asp Leu Gly Val Ala Trp Val Asp Thr Gin lie 

50 55 . 60 

Asp Thr Val Asn Glu Val Arg Glu Trp Phe Ala His Tyr Leu Glu He 
65 70 75 80 

Glu Asn Gin Lys He His Gin Leu Gly Gin Asp Glu Ala Tyr Arg Gly 
85 90 95 

Met Gly Thr Thr Leu Glu Val Leu Ala He He Asp Asn Gin Ala He 
100 105 110 

Tyr Ala His He Gly Asp Ser Arg He Gly Leu He Arg Gly Glu Glu 
115 120 125 

Tyr His Gin Leu Thr Ser Asp His Ser Leu Val Asn Glu Leu Leu Lys 
130 135 140 

Ala Gly Gin Leu Thr Pro. Glu Glu Ala Glu Ala His Pro Gin Lys Asn 
145 150 155 • 160 

He He Thr Gin Ser He Gly Gin Lys Asp Glu He Gin Pro Asp Phe 
165 170 175 

Gly Thr Val He Leu Glu Ser Gly Asp Tyr. Leu Leu Leu Asn Ser Asp 
180 185 190 

Gly Leu Thr Asn Met He Ser Gly Ser Glu He Arg Asp He Val Thr 
195 200 205 

Ser Asp He Pro Leu Ala Asp Lys Thr Glu Thr Leu Val Arg Phe Ala 
210 215 220 

Asn Asn Ala Gly Gly Leu Asp Asn He Thr Val Ala Leu Val Ser Met 
225 . 230 - 235 240 



Asn Glu Glu Asp Glu Glu 
245 



<210> 212 
<211> 276 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 212 

Met Thr He Gin Met Lys Asn Thr Gly Lys Arg He Asp Leu He Ala 
1 5 10 15 

Asn Arg Lys Pro Gin Ser Gin Arg Val Leu Tyr Glu Leu Arg Asp Arg 
20 25 30 

Leu Lys Arg Asn Gin Phe He Leu Asn Asp Thr Asn Pro Asp He Val 
35 40 45 
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He Ser He Gly Gly Asp Gly Met Leu Leu Ser Ala Phe His Lys Tyr 
50 55 60 

Glu Asn Gin Leu Asp Lys Val Arg Phe He Gly Leu His Thr Gly His 
65 70 75 80 

Leu Gly Phe Tyr Thr Asp Tyr Arg Asp Phe Glu Leu Asp Lys Leu Val 
85 90 95 

Thr Asn Leu Gin Leu Asp Thr Gly Ala Arg Val Ser Tyr Pro Val Leu 
100 105 no 

Asn Val Lys Val Phe Leu Glu Asn Gly Glu Val Lys He Phe Arg Ala 
115 120 125 

Leu Asn Glu Ala Ser lie Arg Arg Ser Asp Arg. Thr Met Val Ala Asp 
130 135 140 

He Val He Asn Gly Val Pro Phe Glu Arg Phe Arg Gly Asp Gly Leu 
145 150 155 160 

Thr Val Ser Thr Pro Thr Gly Ser Thr Ala Tyr Asn Lys Ser Leu Gly 
165 170 175 

Gly Ala Val Leu His Pro Thr He Glu Ala Leu Gin Leu Thr Glu He 
180 185 190 

Ala Ser Leu Asn Asn Arg Val Tyr Arg Thr Leu Gly Ser Ser He He 
195 200 205 

Val Pro Lys Lys Asp Lys He Giu Leu He Pro Thr Arg Asn Asp Tyr 
210 - 215 220 

His Thr He Ser Val Asp Asn Ser Val Tyr Ser Phe Arg Asn He Glu 
225 230 235 240 

Arg lie Glu Tyr Gin lie Asp His His Lys lie His Phe Val Ala Thr 
245 250 255 

Pro Ser His Thr Ser Phe Trp Asn Arg Val Lys Asp Ala Phe He Gly 
260 265 270 ■ 

Glu Val Asp Glu 
275 



<210> 213 
<211> 540 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 213 

Met Ser Lys Glu He Lys Phe Ser Ser Asp Ala Arg Ser Ala Met Val 
15 10 15 

Arg Gly Val Asp lie Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro 
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20 



25 



30 



Lys Gly Arg Asn Val Val Leu Glu Lys Ser Phe Gly Ser Pro Leu He 
35 40 45 

Thr Asn Asp Gly Val Thr He Ala Lys Glu He Glu Leu Glu Asp His 
50 55 60 



10 



Phe Glu Asn Met Gly Ala Lys Leu Val Ser Glu Val Ala Ser Lys Thr 
65 70 75 80 



15 



" Asn Asp He Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Thr Gin 
85 90 95 

Ala lie Val Arg Glu Gly He Lys Asn Val Thr Ala Gly Ala Asn Pro 
100 105 HO 



20 



He Gly He Arg Arg Gly He Glu Thr Ala Val Ala Ala Ala Val Glu 

115 120 125 

Ala Leu Lys Asn Asn Ala He Pro Val Ala Asn Lys Glu Ala He Ala 
130 135 140 



25 



Gin Val Ala Ala Val Ser Ser Arg Ser Glu Lys Val Gly Glu Tyr He 
145 150 155 160 



30 



Ser Glu Ala Met Glu Lys Val Gly Lys Asp Gly Val He Thr He Glu 
165 170 175 

Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val Glu Gly Met Gin 
180 185 - 190 



35 



Phe Asp Arg Gly Tyr Leu Ser Gin Tyr Met Val Thr Asp Ser Glu Lys 
195 200 205 

Met Val- Ala Asp Leu Glu Asn Pro Tyr He Leu He Thr Asp Lys Lys 

210 215 220 



'He. Ser Asn He Gin Glu He Leu Pro Leu Leu Glu Ser He Leu Gin 
40 225 230 235 240 



45 



Ser Asn Arg Pro Leu Leu He He Ala Asp Asp Val Asp Gly Glu Ala 
245 250 255 

Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr. Phe Asn Val Val 
260 265' 270 



50 



Ala Val Lys Ala Pro Gly Phe Gly Asp "Arg Arg Lys Ala Met Leu Glu 
275 280 285 

Asp He Ala He Leu Thr Gly Gly Thr Val He Thr Glu Asp Leu Gly 
290 295 300 



55 



Leu Glu Leu Lys Asp Ala Thr He Glu Ala Leu Gly Gin Ala Ala Arg 
305 310 315 * 320 

Val Thr Val Asp Lys Asp Ser Thr Val He Val Glu Gly Ala Gly Asn 
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325 330 335 

Pro Glu Ala He Ser His Arg Val Ala Val He Lys Ser Gin He Glu 
340 345 350 

Thr. Thr Thr Ser Glu Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 3 6 o 365 

Lys Leu Ser Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu 
10 370 375 380 

Thr Glu Leu Lys Glu Met Lys Leu Arg He Glu Asp Ala Leu Asn Ala 
385- 390 .3-95 400 

15 Thr Arg Ala Ala Val Glu Glu Gly He Val Ala Gly Gly Gly Thr Ala 

405 410 415 

Leu Ala Asn Val He Pro Ala Val Ala Thr Leu Glu Leu Thr Gly Asp 
2Q 420 425 430 

Glu Ala Thr Gly Arg Asn He Val Leu Arg Ala Leu Glu Glu Pro Val 
435 440 445 

Arg Gin He Ala His Asn Ala Gly Phe Glu Gly Ser lie Val He Asp 
25 450 455 460 

Arg Leu Lys Asn Ala Glu Leu Gly He Gly Phe Asn Ala Ala Thr Gly 
46 $ 470 475 48O 

30 Glu Trp Val Asn Met He Asp Gin Gly He He Asp Pro Val Lys Val 

485 490 . 495 

Ser Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu He Leu 
35 • 500 505 510 

Thr Thr Glu Ala Val Val Ala Asn Lys Pro Glu Pro Val Ala Pro Ala 
515 520 525 * 

Pro Ala Met Asp Pro Ser Met Met Gly Gly Met Met 
40 530 535 540 



<210> 214 
<211> 481 
45 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 214 

Met He Lys He Glu Thr Val Leu Asp He Leu Lys Lys Asp Gly Leu 
50 1 5 . 10 15 

Phe Arg Glu He He Asp Gin Gly. His Tyr His Tyr Asn Tyr Ser Lys 
20 25 30 

55 Val He Phe Asp Ser He Ser Tyr Asp Ser Arg Lys Val Thr Glu Asp 
35 40 45 
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Thr Leu Phe Phe Ala Lys Gly Ala Ala Phe Lys Lys Glu Tyr Leu Leu 
50 55 60 

Ser Ala He Thr Gin Gly Leu Ala Trp Tyr Val Ala Glu Lys Asp Tyr 
65 70 75 80 

Glu Val Gly He Pro Val He He Val Asn Asp He Lys Lys Ala Met 
85 90 95 

Ser Leu He Ala Met Glu Phe Tyr Gly Asn Pro Gin Glu Lys Leu Lys 
100 105 no 

Leu Leu Ala Phe Thr Gly Thr Lys Gly Lys Thr Thr Ala Ala Tyr Phe 
115 120 125 

Ala Tyr Asn lie Leu Ser Gin Gly His Arg Pro Ala Met Leu Ser Thr 
130 135 140 

Met Asn Thr Thr Leu Asp Gly Glu Thr Phe Phe Lys Ser Ala Leu Thr 
145 150 155 160 

Thr Pro Glu Ser He Asp Leu Phe Asp Met Met Asn Gin Ala Val Gin 
165 170 175 

.Asn Asp Arg Thr His Leu He Met Glu Val Ser Ser Gin Ala Tyr Leu 
180 185 190 

Val Lys Arg Val' Tyr Gly Leu Thr Phe Asp Val Gly Val Phe Leu Asn 
195 200 205 

lie "Ser Pro Asp His He Gly Pro He Glu His Pro Ser Phe Glu Asp 
210 215 220 

Tyr Phe Tyr His Lys Arg Leu Leu Met Glu Lys Ser Arg Ala Val" He 
225 230 235 240 

lie Asn Ser Asp Met Asp His Phe Ser Val Leu Lys Glu Gin Val Glu 
245 250 255 

Asp Gin Asp His Asp Phe Tyr Gly Ser. Gin Phe Asp Asn Gin lie Glu 
260 265 270 

Asn Ser Lys Ala Phe Ser Phe Ser Ala Thr Gly Lys Leu Ala Gly Asp 
275 280 285 

Tyr Asp -lie Gin Leu He Gly Asn Phe Asn Gin Glu Asn Ala Val Ala 
290 295 300 

Ala Gly Leu Ala Cys Leu Arg Leu Gly Ala Ser Leu Glu Asp He Lys 
305 310 315 320 

Lys Gly, lie Ala Ala Thr Arg Val Pro Gly Arg Met Glu Val Leu Thr 
325 330 335 



Gin Lys Asn Gly Ala Lys Val Phe lie Asp Tyr Ala His Asn Gly Asp 
340 345 350 
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Ser Leu Lys Lys Leu He Asn Val Val Glu Thr His Gin Thr Gly Lys 
355 360 365 

He Ala Leu Val Leu Gly Ser Thr Gly Asn Lys Gly Glu Ser Arq Arq 
5 370 375 380 

Lys Asp Phe Gly Leu Leu Leu Asn Gin His Pro Glu lie Gin Val Phe 
38 5 390 395 400 

10 Leu Thr Ala Asp Asp Pro Asn Tyr Glu Asp Pro Met Ala He Ala Asp 

405 410 415 

Glu He Ser Ser Tyr He Asn His Pro Val Glu Lys He Ala Asp Arg 
' 4 20 425 430 

Gin Glu Ala He Lys Ala Ala Met Ala He Thr' Asn His Glu Leu Asp 
435 440 445 

Ala Val He lie Ala Gly Lys Gly Ala Asp Cys Tyr Gin He He Gin 
20 450 455 460 

Gly Lys Lys Glu Ser Tyr Pro Gly Asp Thr Ala Val Ala Glu Asn Tyr 
4 65 470 475 480 

25 Leu 



<210> 215 
30 <211> 659 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 215 

35 Met He Gin He Gly Lys lie Phe Ala Gly Arg Tyr Arg He Val Lys 
1 5 10 is 

Gin He Gly Arg Gly Gly Met Ala Asp Val Tyr Leu Ala Lys Asp Leu 

40 20 " 

He Leu Asp Gly Glu Glu Val Ala Val Lys Val Leu Arg Thr Asn Tyr 
35 40 45 

Gin Thr Asp Pro He Ala Val Ala Arg Phe Gin Arg Glu Ala Arq Ala 
45 50 55 60 

Met Ala Asp Leu Asp His Pro His lie Val Arg He Thr Asp He Gly 
65 70 75 80 

50 Glu Glu Asp Gly Gin Gin Tyr Leu Ala Met Glu Tyr Val Ala Gly Leu 

85 90 95 

Asp Leu Lys Arg Tyr He Lys Glu His Tyr Pro Leu Ser Asn Glu Glu 
^ 100 105 HO 

Ala Val Arg He Met Gly Gin He Leu Leu -Ala Met Arg Leu Ala His 
115 120 125 
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Thr Arg Gly He Val His Arg Asp Leu Lys Pro Gin Asn He Leu Leu 
130 135 140 

5 Thr Pro Asp Gly Thr Ala Lys Val Thr Asp Phe Gly He Ala Val Ala 

150 155 i 6 o 



Phe Ala Glu Thr Ser Leu Thr Gin Thr Asn Ser Met Leu Gly Ser Val 
165 170 175 

His Tyr Leu Ser Pro Glu Gin Ala Arg Gly. Ser Lys Ala Thr Val Gin 
180 185 190 

Ser Asp He Tyr Ala Met Gly He He Phe Tyr Glu Met Leu Thr Gly 
195 200 205 

His He Pro Tyr Asp Gly Asp Ser Ala Val Thr He Ala Leu Gin His 
210 215 220 

20 Phe Gin Lys Pro Leu Pro Ser Val He Ala Glu Asn Pro Ser Val Pro 
225 230 235 240 



10 



15 



25 



Gin Ala Leu Glu Asn Val He He Lys Ala Thr Ala Lys Lys Leu Thr 
245 250 255 

Asn Arg Tyr Arg Ser Val Ser Glu Met Tyr Val Asp Leu Ser Ser Ser 
260 265 270 



30 



Leu Ser Tyr Asn Arg Arg Asn Glu Ser Lys Leu lie Phe Asp Glu Thr 
275 280 285 



Ser Lys Ala Asp Thr Lys Thr Leu Pro Lys Val Ser Gin Ser Thr Leu 
290 295 300 

35 Thr Ser He Pro Lys Val Gin Ala Gin Thr Glu His Lys Ser He Lys 

305 310 315 320 



40 



Asn Pro Ser Gin Ala Val Thr Glu Glu Thr Tyr Gin Pro Gin Ala Pro 
325 330 335 

Lys Lys His Arg Phe Lys Met Arg Tyr Leu He Leu Leu Ala Ser Leu 
340 345 350 



45 



Val Leu Val Ala Ala Ser Leu He Trp He Leu Ser Arg Thr Pro Ala 
355 . 360 365 



Thr He Ala He Pro Asp Val Ala Gly Gin Thr Val Ala Glu Ala Lys 
370 ' 375 380 

50 Ala Thr Leu Lys Lys Ala. Asn Phe Glu He Gly Glu Glu Lys Thr Glu 
38 5 390 395 400 



55 



Ala Ser Glu Lys Val Glu Glu -Gly Arg lie He Arg Thr Asp Pro Gly 
405 410 415 

Ala Gly Thr Gly Arg Lys Glu Gly Thr Lys He Asn Leu Val Val Ser 
420 425 430 
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Ser Gly Lys Gin Ser Phe Gin He Ser Asn Tyr Val Gly Arg Lys Ser 
435 440 445 

Ser Asp Val He Ala Glu Leu Lys Glu Lys Lys Val Pro Asp Asn Leu 
450 455 460 

He Lys He Glu Glu Glu Glu Ser Asn Glu Ser Glu Ala Gly Thr Val 
465 470 475 480 

Leu Lys Gin Ser Leu Pro Glu Gly Thr Thr Tyr Asp Leu Ser Lys Ala 
485 490 495 

Thr Gin lie Val Leu Thr Val Ala Lys Lys Ala Thr Thr He Gin Leu 
500 505 510 

Gly Asn Tyr lie Gly Arg Asn Ser Thr Glu Val He Ser Glu Leu Lys 
515 520 525 

Gin Lys Lys Val Pro Glu Asn Leu He Lys He Glu Glu Glu Glu Ser 
530 535 540 

Ser Glu Ser Glu Pro Gly Thr He Met Lys Gin Ser Pro Gly Ala Gly 
545 550 555 560 

Thr Thr Tyr Asp Val Ser Lys Pro Thr Gin He Val Leu Thr Val Ala 
565 570 575 

Lys Lys Val Thr Ser Val Ala Met Pro Ser Tyr He Gly Ser Ser Leu 
580 585 590 . * 

Glu Phe Thr Lys Asn Asn Leu He Gin He Val Gly He Lys Glu Ala 
595 600 605 

Asn He Glu Val Val Glu Val Thr Thr Ala Pro Ala Gly Ser Ala Glu 
610 615 620 

Gly Met Val Val Glu Gin Ser Pro Arg Ala Gly Glu Lys Val Asp Leu 
625 630 635 640 

Asn Lys Thr Arg Val Lys He Ser He Tyr Lys Pro Lys Thr Thr Ser 
645 650 655 



Ala Thr Pro 



<210> 216 
<211> 391 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 216 

Met Lys His Phe Asp Thr He Val He Gly Gly Gly Pro Ala Gly Met 
1 5 10 15 

Met Ala Thr He Ser Ser Asn Phe Tyr Gly Gin Lys Thr Leu Leu He 
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20 



25 



30 



Glu Lys Asn Arg Lys Leu Gly Lys Lys Leu Ala Gly Thr Gly Gly Gly 
35 40 45 

5 

Arg Cys Asn Val Thr Asn Asn Gly Ser Leu Asp Asn Leu Leu Ala Gly 
50 55 60 

lie Pro Gly Asn Gly Arg Phe Leu Tyr Ser Val Phe Ser Gin Phe Asp 
10 65 70 75 80 

Asn His Asp lie lie Asn Phe Phe Thr Glu Asn Gly Val Lys Leu Lys 
85 90 95 

15 Val Glu Asp His Gly Arg Val Phe Pro Ala Ser Asp Lys Ser Arg Thr 
100 105 * 110 



20 



He He Glu Ala Leu Glu Lys Lys He Thr Glu Leu Gly Gly Gin Val 
115 120 125 

Ala Thr Gin He Glu He Val Ser Val Lys Lys Val Asp Asp Gin Phe 
130 135 140 



Val Leu Lys Ser Ala Asp Gin Thr Phe Thr Cys Glu Lys Leu He Val 

25 145 150 155 160 

Thr Thr Gly Gly Lys Ser Tyr Pro Ser Thr Gly Ser Thr Gly Phe Gly 

165 170 175 

30 His Glu lie Ala Arg His Phe Lys His Thr He Thr Asp Leu Glu Ala 

180 185 190 



35 



Ala Glu Ser Pro Leu Leu Thr Asp Phe Pro His Lys Ala Leu Gin Gly 
195 200 205 

He Ser Leu Asp Asp Val Thr Leu Ser Tyr Gly Lys His Val He Thr 
210 215 220 



His Asp Leu Leu Phe Thr His Phe Gly Leu Ser Gly Pro Ala Ala Leu 

40 225 230 235 240 

Arg Met Ser Ser Phe Val Lys Gly Gly Glu Val Leu Ser Leu Asp Val 

245- 250 255 

45 Leu Pro Gin Leu Ser Glu Lys Asp Leu Val Thr Phe Leu Glu Glu Asn 

260 265 270 



50 



Arg Glu Lys Ser Leu Lys Asn Ala Leu Lys Thr Leu Leu Pro Glu Arg 
275 280 285 

Leu Ala Glu Phe Phe Val Gin Gly Tyr Pro Glu Lys Val Lys Gin Leu 
290 . 295 300 



Thr Glu Lys Glu Arg Glu Gin Leu Val Gin Ser He Lys Glu Leu Lys 
55 305 310 315 320 



He Pro Val Thr Gly Lys Met Ser Leu Ala Lys Ser Phe Val Thr Lys 
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325 330 



335 



Gly Gly Val Ser Leu Lys Glu He Asn Pro Lys Thr Leu Glu Ser Lys 
5 340 345 350 

Leu Val Pro Gly Leu His Phe Ala Gly Glu Val Met Asp He Asn Ala 
355 360 365 

His Thr Gly Gly Phe Asn He Thr Ser Ala Leu Cys Thr Gly Trp Val 
10 370 375 380 



15 



20 



Ala Gly Ser Leu His Tyr Asp 
385 390 



<210> 217 
<211> 231 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 217 

Met Leu Lys Trp Glu Asp Leu Pro Val Glu Met Lys Ser Ser Glu Val 
1 5 io • ^ 



15 



25 Glu Ser Tyr Tyr Gin Leu Val Ser Lys Arg Lys Gly Ser Leu He Phe 
20 25 30 

Lys Arg Cys Leu Asp Trp Val Leu Ala Leu Val Leu Thr Trp Val Leu 

30 

Thr Ser Pro lie Phe Leu lie Leu Ser He Trp He Lys Leu Asp Ser 
50 55 60 

Lys Gly Pro Val He Tyr Lys Gin Glu Arg Val Thr Gin Tyr Asn Arq 
35 65 70 75 80 

Arg Phe Lys He Trp Lys Phe Arg Thr Met Val Thr Asp Ala Asp Lys 
85 90 95 

40 Lys Gly Ser Leu Val Thr Ser Ala Asn Asp Ser Arg lie .Thr Lys Val 
100 ' 105 no 

Gly Asn Phe He Arg Arg Val Arg Leu Asp Glu Leu Pro Gin Leu Val 
■ 115 120 125 

45 

Asn Val Leu Lys Gly Glu Met Ser Phe Val Gly Thr Arg Pro Glu Val 
130 135 140 

Pro Arg Tyr Thr Glu Gin Tyr Ser Pro Glu Met Met Ala Thr Leu Leu 
50 14 5 150 155 160 

Leu Gin Ala 'Gly He Thr Ser Pro Ala Ser He Asn Tyr Lys Asp Glu 
165 170 175 

55 Asp Thr He lie Ser Gin Met Thr Glu Lys Gly Leu Ser Val Asp Gin 
180 185 190 
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10 



15 



25 



40 



Ala Tyr Val Glu His Val Leu Pro Glu Lys Met Arg Tyr Asn Leu Ala 
195 200 205 

Tyr Leu Arg Glu Phe Ser Phe Phe Gly Asp He Lys He Met Phe Gin 
210 215 • 220 

Thr Val Phe Glu Val Leu Lys 
225 230 



<210> 218 
<211> 140 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 218 

Met Thr Ser Pro Leu Leu Glu Ser Arg Arg Gin Leu Arg Lys Cys Ala 
1 5 10 15 

20 Phe Gin Ala Leu Met Ser Leu Glu Phe Gly Thr Asp Val Glu Thr Ala 
20 25 30 



Cys' Arg Phe Ala Tyr Thr His Asp Arg Glu Tyr Thr Asp Val Gin Leu 
35 40 45 

Pro Ala Phe Leu He Asp Leu Val Ser Gly Val Gin Ala Lys Lys Glu 
50 55 60 



Glu Leu Asp Lys Gin He Thr Gin His Leu Lys Ala Gly Trp Thr He 
30 65 70 75 80 

Glu Arg Leu Thr Leu Val Glu Arg Asn Leu Leu Arg Leu Gly Val Phe 
.85 90 95 

35 Glu lie Thr Ser Phe Asp Thr Pro Gin Leu Val Ala Val Asn Glu Ala 
100 105 no 



lie Glu Leu Ala Lys Asp Phe Ser Asp Gin Lys Ser Ma Arg Phe lie 
115 120 125 

Asn Gly Leu Leu Ser Gin Phe Val Thr Glu Glu Gin 
130 135 140 



45 <210> 219 
<211> 1179 
<212> PRT 

<213> Streptococcus pneumoniae 
50 <400> 219 

Met Tyr Leu Lys Glu lie Glu lie Gin Gly Phe Lys Ser Phe Ala Asp 
1 5 io is 

Lys Thr Lys Val Val Phe Asp Gin Gly Val Thr Ala Val Val Gly Pro 
55 20 25 30 

Asn Gly Ser Gly Lys Ser Asn lie Thr Glu Ser Leu Arg Trp Ala Leu 
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35 



40 



45 



Gly Glu Ser Ser Val Lys Ser Leu Arg Gly Gly Lys Met Pro Asp Val 
50 55 60 

He Phe Ala Gly Thr Glu Ser Arg Lys Pro Leu Asn Tyr Ala Ser Val 
65 70 75 80 

Val Val Thr LeU ASp Asn His Asp Gly Phe Ile Lys As P Ala G1 Y Gin 
10 85 90 95 

Glu Ile Arg Val Glu Arg His Ile Tyr Arg Ser Gly Asp Ser Glu Tyr 
100 105 no 

15 Lys Ile Asp Gly Lys Lys Val Arg Leu Arg Asp Ile His Asp Leu Phe 
115 120 125 



20 



Leu Asp Thr Gly Leu Gly Afg Asp Ser Phe Ser Ile lie Ser Gin Gly 
130 135 140 

Lys Val Glu Glu Ile Phe Asn Ser Lys Pro Glu Glu Arg Arg Ala Ile 

150 155 160 



Phe Glu Glu Ala Ala Gly Val Leu Lys Tyr Lys Thr Arg Arg Lys Glu 
25 165 170 175 

Thr Glu Ser Lys Leu Gin Gin Thr Gin Asp Asn Leu Asp Arg Leu Glu 
180 185 190 

30 Asp Ile Ile Tyr Glu Leu Asp Asn Gin Ile Lys Pro Leu Glu Lys Gin 
195 . 200 205 



35 



Ala Glu Asn Ala Arg Lys Phe Leu Asp Leu Glu Gly Gin Arg Lys Ala 
■ 210 215 220 

Ile Tyr Leu Asp Val Leu Val Ala Gin Ile Lys Glu Asn Lys Ala Glu 

225 230 235 240 



Leu Glu Ser Thr Glu Glu Glu Leu Ala Gin Val Gin Glu Leu Leu Met 
40 245 250 255 

Ser Tyr Tyr Gin Lys Arg Glu Lys Leu Glu Glu Glu Asn Gin Thr Leu 
260 265 27-0 

45 Lys Lys Gin Arg Gin Asp Leu Gin Ala Glu Met Ala Lys Asp Gin Gly 
275 280 285 



50 



Ser Leu Met Asp Leu Thr Ser Leu He "Ser Asp Leu Glu Arg Lys Leu 

290 295 300 

Ala Leu Ser Lys Leu Glu Ser Glu Gin Val Ala Leu Asn Gin Gin Glu 

305 310 315 320 



Ala Gin Ala Arg Leu Ala Ala Leu Glu Asp Lys Arg Asn Ser Leu Ser 
55 325 330 335 

Lys Glu Lys Tyr Asp Lys Glu Ser Ser Leu Ala Leu Leu Glu Gly Asn 
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340 



345 



350 



Leu Val Gin Asn Asn Gin Lys Leu Asn Arg Leu Glu Ala Glu Leu Leu 
355 360 365 

Ala Phe Ser Asp Asp Pro Asp Gin Met He Glu Leu Leu Arg Glu Arg 
370 375 380 

Phe Val Ala Leu Leu Gin Glu Glu Ala Asp Val Ser Asn Gin Leu Thr 
385 390 395 400 

Arg He Glu Asn "Glu Leu Glu Asn Ser Arg Gin Leu Ser Gin Lys Gin 
405 410 415 

Ala Asp Gin Leu Glu Lys Leu Lys Glu Gin Leu Ala Thr Ala Lys Glu 
420 425 430 

■ Lys Ala Ser Gin Gin Lys Asp Glu Leu Glu Thr Ala Lys Val Gin Val 
435 440 445 

Gin Lys Leu Leu Ala Asp Tyr. Gin Ala He Ala Lys Glu Gin Glu Glu 
450 455 460 

Gin Lys Thr Ser Tyr Gin Ala Gin Gin Ser Gin Leu Phe Asp Arg Leu 
465 470 475 480 

Asp Ser Leu Lys Asn Lys Gin Ala Arg Ala Gin Ser Leu Glu Asn He 
485 490 495 

Leu Arg Asn His Ser Asn" Phe Tyr Ala Gly Val Lys Ser Val Leu Gin 
500 505 510 

Glu Lys Asp Arg Leu Gly Gly He He Gly Ala Val Ser Glu His Leii 
515 520 525 

Thr Phe Asp Val Tyr Tyr Gin Thr Ala Leu Glu He Ala Leu Gly Ala 
530 535 540 

Ser Ser Gin His He He Val Glu Asp Glu Glu Ser Ala Thr Lys Ala 
545 550 555 560 

He Asp Phe Leu Lys Arg Asn Arg Val Gly Arg Ala Thr Phe Leu Pro 
565 570 575 

Leu Thr Thr He Lys Ala Arg Thr He Ser Ser Gin Asn Gin Asp Ala 
580 585 . 590 

lie Ala Val Ser Pro Gly Phe Leu Gly Met Ala Asp Glu Leu Val Thr 
595 600 605 

Phe Asp Thr Arg Leu Glu Ala He Phe Lys Asn Leu Leu Ala Thr Thr 
610 615 620 

Ala He Phe Asp Thr Val Glu His Ala Arg Glu Ala Ala Arg Gin Val 
625 630 635 640 



Arg Tyr Gin Val Arg Met Val, Thr Leu Asp Gly Thr Glu Leu Arg Thr 
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645 



650 



655 



10 



15 



20 



Gly Gly Ser Tyr Ala Gly Gly Ala Asn Arg Gin Asn Asn Ser He Phe 
660 665 670 

He Lys Pro Glu Leu Glu Gin Leu Gin Lys Glu He Ala Ala Asp Glu 
675 680 685 

Ala Ser Leu Gly Ser Glu Glu Ala Ala Leu Lys Thr Leu Gin Asp Gin 
690 695 700 

Met Ala Ala Leu Thr Glu Arg Leu Glu Ala He Lys Ser Gin Gly Glu 
705 710 715 720 

Gin Ala Arg He Gin Glu Gin Gly Leu Ser Leu Ala Tyr Gin Gin Thr 
725 730 735 

Ser Gin Gin Val Glu Glu Leu Glu Thr Leu Trp Lys Leu Gin Glu Glu 
740 745 750 

Glu He Asp Arg Leu Ser Glu Gly Asp Trp Gin Ala Asp Lys Glu Lys 
755 760 765 



Cys Gin Glu Ser Leu Ala Thr He Ala Ser Asp Lys Gin Asn Leu Glu 
25 770 775 780 



30 



35 



Ala Glu He Glu Glu He Lys Ser Asn Lys Asn Ala lie Gin Glu Arg 
785 790 795 800 

Tyr Gin Asn Leu Gin Glu Glu Val Ala Gin Ala Arg Leu Leu Lys Thr 
805 810 . 815 

Lys Leu Gin Gly Gin Lys Arg Tyr Glu Val Ala Asp He Glu Arg Leu 
820 825 830 

Gly Lys Glu Leu Asp Asn Leu Asn He Glu Gin Glu Glu He Gin Arg 
835 840 845 



Met Leu Gin Glu Lys Val Asp Asn Leu Glu Lys Val Asp Thr Glu Leu 
40 850 855 860 



45 



Leu Ser Gin Gin Ala Glu Glu Ser Lys Thr Gin Lys Thr Asn Leu Gin 
865 870 875 880 

Gin Gly Leu He Arg Lys Gin Phe Glu Leu Asp Asp He Glu Gly Gin 
88'5 890 895 



50 



Leu Asp Asp He Ala Ser His Leu Asp" Gin Ala Arg Gin Gin Asn Glu 
900 - 905 910 

Glu Trp He Arg Lys Gin Thr Arg Ala Glu Ala Lys Lys Glu Lys Val 
915 920 925 



Ser Glu Arg Leu Arg His Leu Gin Asn Gin Leu Thr Asp Gin Tyr Gin 
55 930 935 940 



He Ser Tyr Thr Glu Ala Leu Glu Lys Ala His Glu Leu Glu Asn Leu 
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945 950 955 



960 



Asn Leu Ala Glu Gin Glu Val Gin Asp Leu Glu Lys Ala He Arg Ser 
965 970 975 

Leu Gly Pro Val Asn Leu Glu Ala He Asp Gin Tyr Glu Glu Val His 
980 985 990 

Asn Arg Leu Asp Phe Leu Asn Ser Gin Arg Asp Asp He Leu Ser Ala 
995 1000 1005 

Lys Asn Leu Leu Leu Glu Thr He Thr Glu Met Asn Asp Glu Val Lys 
1010 1015 1020 

Glu Arg Phe Lys Ser Thr Phe Glu Ala He Arg Glu Ser Phe Lys Val 
1025 1030 1035 1040 

Thr Phe Lys Gin Met Phe Gly Gly Gly Gin Ala Asp Leu He Leu Thr 
1045 1050 1055 

Glu Gly Asp Leu Leu Thr Ala Gly Val Glu He Ser Val Gin Pro Pro 
1060 1065 1070 

Gly Lys Lys He Gin Ser Leu Asn Leu Met Ser Gly Gly Glu Lys Ala 
1075 1080 1085 

Leu Ser Ala Leu Ala Leu Leu Phe Ser He He Arg Val Lys Thr He 
1090 1095 HOO 

Pro Phe Val He Leu Asp Glu Val Glu Ala Ala Leu Asp Glu Ala Asn 
1105 mo 1H5 H20 

Val Lys Arg Phe Gly Asp Tyr Leu Asn Arg Phe Asp Lys Asp Ser Gin 
1125 1130 . H35 

Phe He Val Val Thr His Arg Lys Gly Thr Met Ala Ala Ala Asp Ser 
1140 1145 1150 

He Tyr Gly Val Thr Met Gin Glu Ser Gly Val Ser Lys He Val Ser 
1155 1160 1165 

Val Lys Leu Lys Asp Leu Glu Ser He Glu Gly 
1170 1175 



<210> 220 
<211> 447 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 220 

Met Thr Lys Arg Val Thr He lie Asp Val Lys Asp Tyr Val Gly Gin 
15 10 15 

Glu Val Thr He Gly Ala Trp Val Ala Asn Lys Ser Gly Lys Gly Lys 
20 25 30 
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He Ala Phe Leu Gin Leu Arg Asp Gly Thr Ala Phe Phe Gin Gly Val 
35 40 45 

Ala Phe Lys Pro Asn Phe Val Glu Lys Phe Gly Glu Glu Val Gly Leu 
50 55 60 

Glu Lys Phe Asp Val lie Lys Arg Leu Ser .Gin Glu Thr Ser Val Tyr 
65 70 75 80 

Val Thr Gly He Val Lys Glu Asp Glu Arg Ser Lys Phe Gly Tyr Glu 
85 90 95 

Leu Asp He Thr Asp He Glu Val He Gly Glu Ser Gin Asp Tyr Pro 
100 105 HO 

He Thr Pro Lys Glu His Gly Thr Asp Phe Leu Met Asp Asn Arg His 
115 120 125 

Leu Trp Leu Arg Ser Arg Lys Gin Val Ala Val Leu Gin He Arg Asn 
130 135 140 

Ala He He Tyr Ala Thr Tyr Glu Phe Phe Asp Lys Asn Gly Phe Met 
145 150 155 160 

Lys Phe Asp Ser Pro lie Leu Ser Gly Asn Ala Ala Glu Asp Ser Thr 
165 170 175 

Glu Leu Phe Glu Thr Asp Tyr Phe Gly Thr Pro Ala Tyr Leu Ser Gin 
180 185 190 

Ser Gly Gin Leu Tyr Leu Glu Ala Gly Ala Met Ala Leu Gly Arg Val 
195 200 205 

Phe Asp Phe Gly Pro Val Phe Arg Ala Glu Lys Ser Lys Thr Arg Arg 
210 215 220 

His Leu Thr' Glu Phe Trp Met Met Asp Ala Glu Tyr Ser Tyr Leu Thr 
225 230 235 240 

His Asp Glu Ser Leu Asp Leu Gin Glu Ala Tyr Val Lys Ala Leu Leu 
245 250 255 

Gin Gly Val Leu Asp Arg Ala Pro Gin Ala Leu Glu Thr Leu Glu Arg 
260 265 270 

Asp Thr Glu Leu Leu Lys Arg Tyr He Ala Glu Pro Phe Lys Arg lie 
275 280 . 285 

Thr Tyr Asp Gin Ala He Asp Leu Leu Gin Glu His Glu Asn Asp Glu 
290 295 300 

Asp Ala Asp Tyr Glu His Leu Glu His Gly Asp Asp Phe Gly Ser Pro 
305 310 315 320 



His Glu Thr Trp lie Ser Asn His Phe Gly Val Pro Thr Phe Val Met 
325 330 335 
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Asn Tyr Pro Ala Ala lie Lys Ala Phe Tyr Met Lys Pro Val Pro Gly 
340 345 350 

Asn Pro Glu Arg Val Leu Cys Ala Asp Leu Leu Ala Pro Glu Gly Tyr 
5 355 360 365 

Gly Glu He He Gly Gly Ser Met Arg Glu Glu Asp Tyr Asp Ala Leu 
370 375 380 

10 Val Ala Lys Met Asp Glu Leu Gly Met Asp Arg Thr Glu Tyr Glu Phe 
385 390 395 400 

Tyr Leu Asp Leu Arg Lys Tyr Gly Thr Val Pro His Gly Gly Phe Gly 
405 410 415 

15 

lie Gly He Glu Arg Met Val Thr Phe Ala Ala Gly Thr Lys His He 
420 425 430 

Arg Glu Ala He Pro Phe Pro Arg Met Leu His Arg He Lys Pro 
10 435 440 445 



<210> 221 
<211> 308 
5 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 221 

Met Ser Glu Lys . Leu Val Glu He Lys Asp Leu Glu He Ser Phe Gly 
3 1 5 10 15 

Glu Gly Ser Lys Lys Phe Val Ala Val Lys Asn Ala Asn Phe Phe He 
20 25 30 

> Asn Lys Gly Glu Thr Phe Ser Leu Val Gly Glu Ser Gly Ser Gly Lys 
35 40 45 

Thr Thr He Gly Arg Ala He He Gly Leu Asn Asp Thr Ser Asn Gly 
50 55 60 

> 

Asp lie lie Phe Asp Gly Gin Lys lie Asn Gly Lys Lys Ser Arg Glu 
65 70 75 80 

Gin Ala Ala Glu Leu lie Arg Arg lie Gin Met lie Phe Gin Asp Pro 
' 85 90 95 

Ala Ala Ser Leu Asn Glu Arg Ala Thr Val Asp Tyr lie lie Ser Glu 
100 105'" 110 

Gly Leu Tyr Asn His Arg Leu Phe Lys Asp Glu Glu Glu Arg Lys Glu 
115 120 125 

Lys Val Gin Asn He He Arg Glu Val Gly Leu Leu Ala Glu His Leu 
130 135 140 

Thr Arg Tyr Pro His Glu Phe. Ser Gly Gly Gin Arg Gin Arg He Gly 
145 150 155 160 
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He Ala Arg Ala Leu Val Met Gin Pro Asp Phe Val He Ala Asp Glu 
165 170 175 

5 Pro He Ser Ala Leu -Asp Val Ser Val Arg Ala Gin Val Leu Asn Leu 
180 185 190 

Leu Lys Lys Phe Gin Lys Glu Leu Gly Leu Thr Tyr Leu Phe He Ala 
j 195 200 205 

His Asp Leu Ser Val Val Arg Phe Tie Ser Asp Arg lie Ala Val lie 
210 215 220 

Tyr Lys Gly Val He Val Glu Val Ala Glu Thr Glu Glu Leu Phe Asn 
15 225 230 235 240 

Asn Pro He His Pro Tyr Thr Gin Ala Leu Leu Ser Ala Val Pro He 
245 250 255 

20 Pro Asp Pro He Leu Glu Arg Lys Lys Val Leu Lys Val Tyr Asp Pro 
260 265 270 

Ser Gin His Asp Tyr Glu Thr Asp Lys Pro Ser Met Val Glu He Arg 
25 275 280 285 

Pro Gly His Tyr Val Trp Ala Asn Gin Thr Glu Leu Ala Arg Tyr Gin 
290 295 300 

Lys Gly Leu Asn 
30 305 



<210> 222 
<211> 424 
35 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 222 

Met Lys lie Ser Trp Asn Gly Phe Ser Lys Lys Ser Tyr Gin Glu Arq 
40 1 5 10 15 

Leu Glu Leu Leu Lys Ala Gin Ala Leu Leu Ser Pro Glu Arg Gin Ala 
20 25 30 

45 Ser Leu Glu Lys Asp Glu Gin Met Ser Val Thr Val Ala Asp Gin Leu 
35 40 . 45 

Ser Glu Asn Val Val. Gly Thr Phe Ser "Leu Pro Tyr Ser Leu Val Pro 
50 55 60 

50 

Glu Val Leu Val Asn Gly Gin Glu Tyr Thr Val Pro Tyr Val Thr Glu 
■ 65 70 75 80 

Glu Pro Ser Val Val Ala Ala Ala Ser Tyr Ala Ser Lys He He Lys 
55 85 90 95 

Arg Ala Gly Gly Phe Thr Ala Gin Val His Gin Arg Gin Met He Gly 
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100 



105 



110 



Gin Val Ala Leu Tyr Gin lie Ala Asn Pro Lys Leu Ala Gin Glu Lys 
115 120 125 

He Ala Ser Lys Lys Ala Glu Leu Leu Glu Leu Ala Asn Gin Ala Tyr 
130 135 140 

Pro Ser He Val Lys Arg Gly Gly Gly Ala Arg Asp Leu His Val Glu 
10 145 150 155 160 

Gin He Lys Gly Glu Pro Asp Phe Leu Val Val Tyr He His Val Asp 
165 170 175 

15 Thr Gin Glu Ala Met Gly Ala Asn Met Leu Asn Thr Met Leu Glu Ala 
180 185 190 



20 



35 



Leu Lys Pro Val Leu Glu Glu Leu Ser Gin Gly Gin Ser* Leu Met Gly 
195 200 205 

He Leu Ser Asn Tyr Ala Thr Asp Ser Leu Val Thr Ala Ser Cys Arq 
210 215 220 



He Ala Phe Arg Tyr Leu Ser Arg Gin Lys Asp Gin Gly Arg Glu He 
25 225 230 235 240 

Ala Glu Lys He Ala Leu Ala Ser Gin Phe Ala Gin Ala Asp Pro Tyr 
2^5 250 255 

30 Arg Ala Ala Thr His Asn Lys Gly He Phe Asn Gly He Asp Ala He 
260 265 270 



Leu He Ala Thr Gly Asn Asp Trp Arg Ala He Glu Ala Gly Ala His 
275 280 285 

Ala Phe Ala Ser Arg Asp Gly Arg Tyr Gin Gly Leu Ser Cys Trp Thr 
290 295 300 



Leu Asp Leu Glu Arg Glu Glu Leu Val Gly Glu Met Thr Leu Pro Met 
40 305 310 315 320 

Pro Val Ala Thr Lys Gly Gly Ser He Gly Leu Asn Pro Arg Val Ala 
325 ' 330 335 

45 Leu Ser His Asp Leu Leu Gly Asn Pro Ser Ala Arg Glu Leu Ala Gin 
340 345 350 



50 



lie lie Val Ser He Gly Leu Ala Gin Asn Phe Ala Ala Leu Lys Ala 
355 360 365 

Leu Val Ser Thr Gly He Gin Gin Gly His Met Lys Leu Gin Ala Lys 
370 375 380 



Ser Leu Ala Leu Leu Ala Gly Ala Ser Glu Ser Glu Val Ala Pro Leu 
55 38 5 390 395 400 

Val Glu Arg Leu He Ser Asp Lys Thr Phe Asn Leu Glu Thr Ala Gin 
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405 410 415 



Arg Tyr Leu Glu Asn Leu Arg Ser 
420. 



<210> 223 
<211> 262 
<212> PRT 
10 <213> Streptococcus pneumonic 

<400> 223 



15 



30 



Met Pro lie Thr Ser Leu Glu lie Lys Asp Lys Thr Phe Gly Thr Ara 
1 5 10 15 

Phe Arg Gly Phe Asp Pro Glu Glu Val Asp Glu Phe Leu Asp He Val 
20 -25 30 



Val Arg Asp Tyr Glu Asp Leu Val Arg Ala Asn His Asp Lys Asn Leu 
20 35 40 45 

Arg He Lys Ser Leu Glu Glu Arg Leu Ser Tyr Phe Asp Glu lie Lys 
50 55 60 

25 Asp Ser Leu Ser Gin Ser Val Leu He Ala Gin Asp Thr Ala Glu Arq 
65 70 75 80 



Val Lys Gin Ala Ala His Glu Arg Ser Asn Asn He He His Gin Ala 
85 90 95 

Glu Gin Asp Ala Gin Arg Leu Leu Glu Glu Ala Lys Tyr Lys Ala Asn 
100 105 no 



Glu He Leu Arg Gin Ala Thr Asp Asn Ala Lys Lys Val Ala Val Glu 
35 H5 120 i 2 5 

Thr Glu Glu Leu Lys Asn Lys Ser Arg Val Phe His Gin Arq Leu Lys 
130 135 140 / 

40 Ser Thr He Glu Ser Gin Leu Ala He Val Glu Ser Ser Asp Trp Glu 
145 150 155 ; 160 

Asp He Leu Arg Pro Thr Ala Thr Tyr Leu Gin Thr Ser Asp Glu Ala 
45 165 170 175 

Phe Lys Glu Val Val Ser Glu Val Leu Gly Glu Pro He Pro Ala Pro 
180 185 190 

lie Glu Glu Glu Pro He Asp Met Thr Arg Gin Phe Ser Gin Ala Glu 
50 195 200 205 

Met Ala Glu Leu Gin Ala Arg He Glu Val Ala Asp Lys Glu Leu Ser 
210 215 220 

55 Glu Phe Glu Ala Gin He Lys Gin Glu Val Glu Ala Pro Thr Pro Val 
22 5 230 235 240 
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Val Ser Pro Gin Val Glu Glu Glu Pro Leu Leu He Gin Leu Ala Gin 
245 250 255 

Cys Met Lys Asn Gin Lys 
5 260 



<210> 224 
<211> 575 
10 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 224 

Met Ser Asn Gly Gin Leu He Tyr Leu Met Val Ala He Ala Val He 
15 1 5 10 15 

Leu Val Leu Ala Tyr Val Val Ala He Phe Leu Arg Lys Arg Asn Glu 
20 25 30 

20 Gly Arg Leu Glu Ala Leu Glu Glu Arg Lys" Glu Glu Leu Tyr Asn Leu 
35 40 45 

Pro Val Asn Asp Glu Val Glu Ala Val Lys Asn Met His Leu He Gly 
50 55 60 

25 

Gin Ser Gin Val Ala Phe Arg Glu Trp Asn Gin Lys Trp Val Asp Leu 
65 70 75 80 

Ser Leu Asn Ser Phe Ala Asp He Glu Asn Asn Leu Phe Glu Ala Glu 
30 85 90 95 

Gly Tyr Asn His Ser Phe Arg Phe Leu Lys Ala Ser His Gin He Asp 
100 105 HO 

35 Gin He Glu Ser Gin He Thr Leu He Glu Glu Asp He Ala Ala He 
115 120 125 

Arg Asn Ala Leu Ala Asp Leu Glu Lys Gin Glu Ser Lys Asn Ser Gly 
130 135 140 

40 

Arg Val Leu His Ala Leu Asp Leu Phe Glu Glu Leu Gin His Arg Val 
145 150 155 160 

Ala Glu Asn Ser Glu Gin Tyr Gly Gin Ala Leu Asp Glu He Glu Lys 
45 165 170 175 

Gin Leu Glu Asn He Gin Ser Glu Phe Ser Gin Phe Val Thr Leu Asn 
180 185" 190 

50 Ser Ser Gly Asp Pro Val Glu Ala Ala Val He Leu Asp Asn Thr Glu 
195 200 205 

Asn His He Leu Ala Leu Ser His lie Val Asp Arg Val Pro Ala Leu 
210 215 220 

55 

Val Thr Thr Leu. Ser Thr Glu Leii Pro Asp Gin Leu Gin Asp Leu Glu 
225 230 235 240 
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Ala Gly Tyr Arg Lys Leu lie Asp Ala Asn Tyr His Phe Val Glu Thr 
24 5 250 255 

Asp lie Glu Ala Arg Phe His Leu Leu Tyr Glu Ala Phe Lys Lys Asn 
260 . 265 270 

Gin Glu Asn lie Arg Gin Leu Glu Leu Asp Asn Ala Glu Tyr Glu Asn 
275 280 285 

Gly Gin Ala Gin Glu Glu He Asn Ala Leu Tyr Asp He Phe Thr Arg 
290 295 300 

Glu He Ala Ala Gin Lys Val Val Glu Asn Leu Leu Ala Thr Leu Pro 
305 310 315 320 

Thr Tyr Leu Gin His Met Lys Glu Asn Asn Thr Leu Leu Gly Glu Asp 
325 • 330 335 

He Ala Arg Leu Asn Lys Thr Tyr Leu Leu Pro Glu Thr Ala Ala Ser 
340 ' 345 350 

His Val Arg Arg lie Gin Thr Glu Leu Glu Ser Phe Glu Ala Ala lie 
355 360 365 

Val Glu Val Thr Ser Asn Gin Glu Glu Pro Thr Gin Ala Tyr Ser Val 
370 375 380 

Leu Glu Glu Asn Leu Glu Asp Leu Gin Thr Gin Leu Lys Asp He Glu 
385 390 395 400 

Asp Glu Gin lie Ser Val Ser Glu Arg Leu Thr Gin He Glu Lys Asp 
405 410 415 

Asp He Asn Ala Arg Gin Lys Ala Asn Val Tyr Val Asn Arg Leu His 
420 425 430 

Thr lie Lys Arg Tyr Met Glu Lys Arg Asn Leu Pro Gly He Pro Gin 
435 440 445 

Thr Phe Leu Lys Leu Phe Phe Thr Ala Ser Asn Asn Thr Glu Asp Leu 
450 455 460 

Met Val Glu Leu Glu Gin Lys Met He Asn He Glu Ser Val Thr Arg 
465 470 475. 480 

Val Leu Glu lie Ala Thr Asn Asp Met. Glu Ala Leu Glu Thr Glu Thr 
485 490 495 

Tyr Asn lie Val Gin Tyr Ala Thr Leu Thr Glu Gin Leu Leu Gin Tyr 
500 505 ' 510 

Ser Asn Arg Tyr Arg Ser Phe Asp Glu. Arg lie Gin Glu Ala Phe Asn 
515 520 525 



Glu Ala Leu Asp He Phe Glu Lys Glu Phe Asp Tyr His Ala Ser Phe 
530 535 540 
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Asp Lys He Ser Gin Ala Leu Glu Val Ala Glu Pro Gly Val Thr Asn 
545 550 555 560 

Arg Phe Val Thr Ser Tyr Glu Lys Thr Arg Glu Thr He Arg Phe 

565 570 575 



<210> 225 
<211> 800 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 225 

Met Leu He Ser Tyr Lys Trp Leu Lys Glu Leu Val Asp He Asp Val 
1 5 10 15 

Pro Ser Gin Glu Leu Ala Glu Lys Met Ser Thr Thr Gly He Glu Val 
20 25 30 

Glu Gly Val Glu Ser Pro Ala Ala Gly Leu Ser Lys He Val Val Gly 
35 40 45 

Glu Val Leu Ser Cys Glu Asp Val Pro Glu Thr His Leu His Val Cys 
50 55 60 

Gin Val Asn Val Gly Glu Glu Glu Arg Gin He Val Cys Gly Ala Pro 
65 70 75 80 

Asn Val Arg Ala Gly He Lys Val Met Val Ala Leu Pro Gly Ala Arg 
85 90 95 

He Ala Asp Asn Tyr Lys He Lys Lys Gly Lys lie Afg Gly Leu Glu 
100 105 HO 

Ser Leu Gly Met He Cys Ser Leu Gly Glu Leu Gly He Ser Asp Ser 
115 120 125 

Val Val Pro Lys Glu Phe Ala Asp Gly He Gin He Leu Pro Glu Asp 
130 135 140 

Ala Val Pro Gly Glu Glu Val Phe Ser Tyr Leu Asp Leu Asp Asp Glu 
145 150 155 160 

He He Glu Leu Ser He Thr Pro Asn Arg Ala Asp Ala Leu Ser Met 
165 170 175 

Cys Gly Val Ala His Glu Val Ala Ala He Tyr Asp Lys Ala Val Asn 
180 185 190 

Phe Lys Glu Phe Thr Leu Thr Glu Thr Asn Glu Ala Ala Ala Asp Ala 
195 200 205 

Leu Ser Val Ser He Glu Thr Asp Lys Ala Pro Tyr Tyr Ala Ala Arg 
210 215 220 

He Leu Asp Asn Val Thr He Ala Pro Ser Pro Gin Trp Leu Gin Asn 
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225 



230 



235 



240 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Leu Leu Met Asn Glu Gly lie Arg Pro lie Asn Asn Val Val Asp Val 
245 250 255 

Thr Asn Tyr lie Leu Leu Tyr Phe Gly Gin Pro Met His Ala Phe Asp 
260 265 270 

Leu Asp Asn Phe Glu Gly Thr Asp lie Arg Val Arg Glu Ala Arg Ala 
275 280 285 

Gly Glu Lys Leu Val Thr Leu Asp Gly Glu Glu Arg Asp Leu Asp Val 
290 295 300 

Asn Asp Leu Val lie Thr Val Ala Asp Lys Pro Val Ala Leu Ala Gly 
305 310 315 320 

Val Met Gly Gly Gin Ala Thr Glu lie Ser Glu Lys Ser Ser Arg Val 
325 330 335 

Val Leu Glu Ala Ala Val Phe Asn Gly Lys Ser lie Arg Lys Thr Ser 
340 345 350 

Gly Arg Leu Asn Leu Arg Ser Glu Ser Ser Ser Arg Phe Glu Lys Gly 
355 360 365 

He Asn Val Ala Thr Val Asn Glu Ala Leu Asp Ala Ala Ala Ser Leu 
370 375 380 

He Ala Glu Leu Ala Gly Ala Thr Val Arg Lys Gly lie Val Ser Ala 
385 390 395 400 

Gly Glu Leu Asp Thr Ser Asp Val Glu Val Ser Ser Thr Leu Ala Asp 
405 410 415 

Val Asn Arg Val Leu Gly Thr Glu Leu Ser Tyr Ala Asp Val Glu Asp 
420 425 430 

Val Phe Arg Arg Leu Gly Phe Gly Leu Ser Gly Asn Ala Asp Ser Phe 
435 440 445 

Thr Val Arg Val Pro Arg Arg Arg Trp Asp He Thr He Glu Ala Asp 
450 455 460 

Leu Phe Glu Glu He Ala Arg He Tyr Gly Tyr Asp Arg Leu Pro Thr 
465 " 470 475 480 

Ser Leu Pro Lys Asp Asp Gly Thr Ala" Gly Glu Leu Thr Ala Thr Gin 
485 490 495 

Lys Leu Arg Arg Gin Val Arg Thr He Ala Glu Gly Ala Gly Leu Thr 
500 505 510 

Glu He He Thr Tyr Thr Leu Thr Thr Pro Glu Lys Ala Val Glu Phe 
515 520 525 

Thr Ala Gin Pro Ser Asn Leu Thr Glu Leu Met Trp Pro Met Thr Val 
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530 535 * 540 

Asp Arg Ser Val Leu Arg Gin Asn Met lie Ser Gly lie Leu Asp Thr 
545 550 555 5 6 o 

Val Ala Tyr Asn Val Ala Arg Lys Asn Lys Asn Leu Ala Leu Tyr Glu 
565 570 575 

He Gly Lys Val Phe Glu Gin Thr Gly Asn Pro Lys Glu Glu Leu Pro 
!0 580 585 590 

Asn Glu He Asn Ser Phe Ala Phe Ala Leu Thr Gly Leu Val Ala Glu 
595 600 605 

15 Lys Asp Phe Gin Thr Ala Ala Val Pro Val Asp Phe Phe Tyr Ala Lys 
610 615 620 

Gly He Leu Glu Ala Leu Phe Thr Arg Leu Gly Leu Gin Val Thr Tyr 
2Q 6 ? 5 630 635 640 

Thr Ala Thr Ser Glu He Ala Ser Leu His Pro Gly Arg Thr Ala Val 
645 650 655 

He Ser Leu Gly Asp Gin Val Leu Gly Phe Leu Gly Gin Val His Pro 
25 660 665 670 

Val Thr Ala Lys Ala Tyr Asp He Pro Glu Thr Tyr Val Ala Glu Leu 
675 680 685 

30 Asn Leu Ser Ala He Glu Ala Ala Leu Gin Pro Ala Thr Pro Phe Val 
690 695 700 

Glu He Thr Lys Phe Pro Ala Val Ser Arg Asp Val Ala Leu Leu Leu 
35 7 05 710 715 720 

Lys Ala Glu Val Thr His Gin Glu Val Val Asp Ala He Gin Ala Ala 
725 730 735 

Gly Val Lys Arg Leu Thr Asp He Lys Leu Phe Asp Val Phe Ser Gly 
40 740 745 750 

Glu Lys Leu Gly Leu Gly Met Lys Ser Met Ala Tyr Ser Leu Thr Phe 
755 760 765 

45 Gin Asn Pro Glu Asp Ser Leu Thr Asp Glu Glu Val Ala Arg Tyr Met 
770 775 780 

Glu Lys He Gin Ala Ser Leu Glu Glu "Lys Val Asn Ala Glu Val Arg 
5o 785 790 795 800 



55 <210> 226 
<211> 180 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 226 

Met Leu Glu Asn Asp lie Lys Lys" Val Leu Val Ser His Asp Glu lie 
1 5 io ^ 



15 



Thr Glu Ala Ala Lys Lys Leu Gly Ala Gin Leu Thr Lys Asp Tyr Ala 
20 25 30 

Gly Lys Asn Pro He Leu Val Gly He Leu Lys Gly Ser He Pro Phe 
35 4 o 45 

Met Ala Glu Leu Val Lys His He Asp Thr His He Glu Met Asp Phe 
50 55 60 

Met Met Val Ser Ser Tyr His Gly Gly Thr Ala Ser Ser Gly Val He 
65 7 ° 75 80 

Asn He Lys Gin Asp Val Thr Gin Asp lie Lys Gly Arg His Val Leu 
85 go 95 

Phe Val Glu Asp He He Asp Thr Gly Gin Thr Leu Lys Asn Leu Arg 
100 105 no 

Asp Met Phe Lys Ala Arg Glu Ala Ala Ser Val Lys He Ala Thr Leu 

120 125 

Leu Asp Lys Pro Glu Gly Arg Val Val Glu He Glu Ala Asp Tyr Thr 
130 135 140 

Cys Phe Thr He Pro Asn Glu Phe Val Val Gly Tyr Gly Leu Asp Tvr 
145 • 150 155 ' F 160 

Lys Glu Asn Tyr Arg Asn Leu Pro Tyr He Gly Val Leu Lys Glu Glu 
165 170 175 

Val Tyr Ser Asn 
180 



190 



